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Introduction 


Since its release in 1979, Oracle Database has become the leading relational 
database management system in the world. Today, Oracle has a huge base of 
customers and continues to dominate the database market, especially for mis- 
sion-critical enterprise systems. In short, Oracle Database and Oracle database 
programmers are here to stay. 

Over the years, Oracle Database has gained a reputation for being expensive 
and difficult to use. Today, however, Oracle offers the Express Edition of Oracle 
Database as a free download. Oracle also offers a free product called SQL 
Developer that makes it easier to work with an Oracle database. And now, this 
book takes advantage of these products to make it easier than ever to master the 
SQL and PL/SQL skills that you need to work with an Oracle database. 


Who this book is for 


This book is designed for application developers who need to work with an 
Oracle database. It shows how to code all the SQL statements that developers 
need for their applications. It shows how to code these statements so they run 
efficiently. And it works for users of any release of Oracle Database through 10g 
and 11g. 

This book is also a good choice for anyone who wants to learn standard 
SQL. Since SQL is a standard language for accessing database data, most of the 
SQL code in this book will work with any database management system. As a 
result, once you use this book to learn how to use SQL to work with an Oracle 
database, you can transfer most of what you have learned to another database 
management system such as SQL Server or DB2. 

This book is also the right first book for anyone who wants to become a 
database administrator. Although this book doesn’t present the advanced skills 
that are needed by a DBA, it will get you started in that direction. Then, when 
you complete this book, you’ll be prepared for more advanced books on the 
subject. 
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5 reasons why you'll learn faster with this book 


Unlike most Oracle books, this one starts by showing you how to query an 
existing database rather than how to create a new database. Why? Because 
that’s what you’re most likely to need to do first on the job. Once you master 
those skills, you can learn how to design and implement a database if you 
need to do that. Or, you can learn how to work with other database features 
like transactions or stored procedures if you need to do that. 


Unlike most Oracle books, this one shows how to use Oracle SQL Developer 
to enter and run your SQL statements. SQL Developer is a graphical tool 
that’s an intuitive and user-friendly replacement for SQL*Plus, an arcane 
command prompt tool that has been around since the early days of Oracle. As 
a result, using SQL Developer instead of SQL*Plus will help you learn more 
quickly. 


Like all our books, this one includes hundreds of examples that range from 
the simple to the complex. That way, you can quickly get the idea of how a 
feature works from the simple examples, but you’ll also see how the feature is 
used in the real world from the complex examples. 


Like most of our books, this one has exercises at the end of each chapter that 
give you hands-on experience by letting you practice what you’ve learned. 
These exercises also encourage you to experiment and to apply what you’ve 
learned in new ways. 


If you page through this book, you’ll see that all of the information is pre- 
sented in “paired pages,” with the essential syntax, examples, and guidelines 
on the right page and the perspective and extra explanation on the left page. 
This helps you learn more with less reading, and it is the ideal reference 
format when you need to refresh your memory about how to do something. 


What you'll learn in this book 


In section 1, you’ll learn the concepts and terms you need for working with 
any database. You'll also learn how to use Oracle Database and Oracle SQL 
Developer to run SQL statements on your own computer. 


In section 2, you'll learn all the SQL skills for retrieving data from a database 
and for adding, updating, and deleting that data. These skills move from the 
simple to the complex so you won’t have any trouble if you’re new to SQL. 
But these skills are also sure to raise your expertise even if you already have 
SQL experience. 


In section 3, you’ll learn how to design a database and how to implement that 
design by using the DDL (Data Definition Language) statements that are a 
part of SQL. When you’re done, you'll be able to design and implement your 


Introduction XVII 


own database. In addition, you’ll gain valuable perspective that will make 
you a better SQL programmer, even if you never have to design a database. 


e Insection 4, you'll learn how to use Oracle’s procedure language, PL/SQL, 
to create stored procedures, functions, and triggers. In addition, you’ll learn 
how to manage transactions and locking. Once you master these features, 
you'll have a powerful set of PL/SQL skills. 


e In section 5, you’ll learn how to work with the timestamp, interval, and large 
object data types. These data types became available with releases 9i and 8, 
and they provide features that are critical for storing data in today’s global 
and digital world. 


What software you need for this book 


To run SQL statements with an Oracle database using the techniques in this 
book, we recommend that you use: 


e The Express Edition of Oracle Database 10g or 11g 
e Oracle SQL Developer 


Both of these products can be downloaded for free from Oracle’s web site. And 
appendix A of this book provides complete instructions for installing them. 

If you want to use another edition of Oracle Database, you can still use the 
techniques described in this book. However, you may need to install that edition, 
and you may need to use a different procedure to connect SQL Developer to it. 
That’s why appendix B shows you how to do both. 

Unfortunately, you can’t use SQL Developer to connect to versions of Oracle 
Database that are prior to version 9.2.0.1. So if your company is using an earlier 
version, one alternative is to use SQL*Plus to work with it as described in chapter 
2. A better training alternative, though, is to download and install the Express 
Edition of Oracle Database and SQL Developer as described in appendix A. 
Then, when you’re through training, you can use SQL*Plus or a commercial 
product like Toad to work with the version of Oracle Database that your company 
uses. 


What you can download from our web site 


You can download all the source code for this book from our web site. That 
includes: 


e The script files that create and populate the database tables for the three 
schemas used by this book. 


e The SQL, PL/SQL, and Java source code for all of the examples in this book 


Here again, appendix A provides complete instructions for downloading and 
installing these items on your computer. 


XVIii Introduction 


If you’re a corporate trainer or a college instructor who would like to use this 
book for a course, we offer an Instructor’s CD that includes: (1) instructional 
objectives that describe the skills a student should have upon completion of each 
chapter; (2) the solutions to the book exercises; (3) completion and multiple- 
choice tests that measure mastery of the skills described in the objectives; and 
(4) PowerPoint slides that you can use to review and reinforce the content that’s 
presented in the figures of the book. 

To learn more about this Instructor’s CD and to find out how to get it, please 
go to our web site at www.murach.com and click on the Trainers link or the 
Instructors link. Or, if you prefer, you can call Kelly at 1-800-221-5528 or send 


an email to kelly@murach.com. 
Please let us know how this book works for you 


My goals when I started this book were (1) to provide an Oracle book for 
application developers that will help them work more effectively; (2) to cover the 
database design and implementation skills that application developers are most 
likely to use; and (3) to do both in a way that helps you learn faster and better 
than you can with any other book. Now, I sincerely hope that I have succeeded. 

If you have any comments about this book, I would appreciate hearing from 
you. Good luck with your Oracle projects! 


Joel Murach, Author 
joelmurach @ yahoo.com 


Section 1 


An introduction to SQL 


Before you begin to learn the fundamentals of programming in SQL, you need to 
understand some concepts and terms related to SQL and relational databases. 
That’s what you’ll learn in chapter 1. Then, in chapter 2, you’ll learn about some 
of the tools you can use to work with an Oracle database. At that point, you’ll 
have all of the background and skills that you need to work with the rest of this 
book. 
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An introduction to relational 
databases and SQL 


This chapter presents the concepts and terms that you should understand before 
you begin learning how to use SQL to work with an Oracle database. Although 
this chapter doesn’t present the coding details, it does present an overview of 
the most important types of SQL statements that are presented in this book. 
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Section 1 An introduction to SQL 


An introduction to client/server 
systems 


In case you aren’t familiar with client/server systems, the topics that follow 
introduce you to their essential hardware and software components. When you 
use SQL to access an Oracle database, that system is often a client/server 
system. 


The hardware components of a client/server 
system 


Figure 1-1 presents the three hardware components of a client/server 
system: the clients, the network, and the server. The clients are usually the PCs 
that are already available on the desktops throughout a company. And the 
network is the cabling, communication lines, network interface cards, hubs, 
routers, and other components that connect the clients and the server. 

The server, commonly referred to as a database server, is a computer that 
has enough processor speed, internal memory (RAM), and disk storage to store 
the files and databases of the system and provide services to the clients of the 
system. This computer can be a high-powered PC, a midrange system like an 
AS/400 or Unix system, or even a mainframe system. When a system consists 
of networks, midrange systems, and mainframe systems, often spread through- 
out the country or world, it is commonly referred to as an enterprise system. 

To back up the files of a client/server system, a Server usually has a tape 
drive or some other form of offline storage. It often has one or more printers or 
specialized devices that can be shared by the users of the system. And it can 
provide programs or services like e-mail that can be accessed by all the users of 
the system. 

In a simple client/server system, the clients and the server are part of a local 
area network (LAN). However, two or more LANs that reside at separate 
geographical locations can be connected as part of a larger network such as a 
wide area network (WAN). In addition, individual systems or networks can be 
connected over the Internet. 
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A simple client/server system 
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The three hardware components of a client/server system 


The clients are the PCs, Macintoshes, or workstations of the system. 

The server is a computer that stores the files and databases of the system and provides 
services to the clients. When it stores databases, it’s often referred to as a database 
server, 

The network consists of the cabling, communication lines, and other components that 
connect the clients and the servers of the system. 


Client/server system implementations 


In a simple client/server system like the one above, the server is typically a high-powered 
PC that communicates with the clients over a local area network (LAN). 


The server can also be a midrange system, like an AS/400 or a Unix system, or it can be 
a mainframe system. Then, special hardware and software components are required to 
make it possible for the clients to communicate with the midrange and mainframe 
systems. 

A client/server system can also consist of one or more PC-based systems, one or more 
midrange systems, and a mainframe system in dispersed geographical locations. This 
type of system is commonly referred to as an enterprise system. 

Individual systems and LANs can be connected and share data over larger private 
networks, such as a wide area network (WAN), or a public network like the Internet. 


Figure 1-1 The hardware components of a client/server system 
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The software components of a client/server 
system 


Figure 1-2 presents the software components of a typical client/server 
application. Here, the server requires a database management system (DBMS) 
like Oracle or Microsoft SQL Server. This DBMS manages the databases that 
are stored on the server. 

In contrast to a server, each client requires application software to perform 
useful work. This can be a purchased software package like a financial account- 
ing package, or it can be custom software that’s developed for a specific appli- 
cation. 

Although the application software is run on the client, it uses data that’s 
stored on the server. To do that, it uses a data access API (application program- 
ming interface) such as JDBC. Since the technique you use to work with an API 
depends on the programming language and API you’re using, you won’t learn 
those techniques in this book. Instead, you’ll learn about a standard language 
called SQL (Structured Query Language) that lets any application communicate 
with any DBMS. (In conversation, SQL is pronounced as either S-Q-L or 
sequel.) 

Once the software for both client and server is installed, the client commu- 
nicates with the server via SQL queries (or just queries) that are passed to the 
DBMS through the API. After the client sends a query to the DBMS, the DBMS 
interprets the query and sends the results back to the client. 

In a client/server system, the processing is divided between the clients and 
the server. In this figure, for example, the DBMS on the server processes the 
requests that are made by the application running on the client. Theoretically, at 
least, this balances the workload between the clients and the server so the 
system works more efficiently. 
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Client software, server software, and the SQL interface 


SQL queries 
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Server software 


e To store and manage the databases of the client/server system, each server requires a 
database management system (DBMS) like Oracle. 


e The processing that’s done by the DBMS is typically referred to as back-end processing, 
and the database server is referred to as the back end. 


Client software 
e The application software does the work that the user wants to do. This type of software 
can be purchased or developed. 


e The data access API (application programming interface) provides the interface between 
the application program and the DBMS. The most common data access API for Oracle is 
JDBC (Java Database Connectivity). 

e The processing that’s done by the client software is typically referred to as front-end 
processing, and the client is typically referred to as the front end. 


The SQL interface 

e The application software communicates with the DBMS by sending SQL queries 
through the data access API. When the DBMS receives a query, it provides a service like 
returning the requested data (the query results) to the client. 

e SQL stands for Structured Query Language, which is the standard language for working 
with a relational database. 


Client/server versus file-handling systems 


e Ina client/server system, the processing done by an application is typically divided 
between the client and the server. 

e Ina file-handling system, all of the processing is done on the clients. Although the 
clients may access data that’s stored in files on the server, none of the processing is done 
by the server. As a result, a file-handling system isn’t a client/server system. 


Figure 1-2 The software components of a client/server system 
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Other client/server architectures 


In its simplest form, a client/server system consists of a single database 
server and one or more clients. Many client/server systems today, though, 
include additional servers. For example, figure 1-3 shows two client/server 
systems that include an additional server between the clients and the database 
server. 

The first illustration is for a simple Windows-based system. With this 
system, only the user interface for an application runs on the client. The rest of 
the processing that’s done by the application is stored in one or more business 
components on the application server. Then, the client sends requests to the 
application server for processing. If the request involves accessing data in a 
database, the application server formulates the appropriate query and passes it 
on to the database server. The results of the query are then sent back to the 
application server, which processes the results and sends the appropriate re- 
sponse back to the client. 

Similar processing is done by a web-based system, as illustrated by the 
second example in this figure. In this case, though, a web browser running on 
the client is used to send requests to a web application running on a web server 
somewhere on the Internet. The web application, in turn, can use web services 
to perform some of its processing. Then, the web application or web service can 
pass requests for data on to the database server. 

Although this figure should give you an idea of how client/server systems 
can be configured, you should realize that they can be much more complicated 
than what’s shown here. For example, business components can be distributed 
over any number of application servers, and those components can communi- 
cate with databases on any number of database servers. Similarly, the web 
applications and services in a web-based system can be distributed over numer- 
ous web servers that access numerous database servers. In most cases, though, 
you don’t need to know how a system is configured to use SQL. 

Before I go on, you should know that client/server systems aren’t the only 
systems that support SQL. For example, traditional mainframe systems and 
newer thin client systems also use SQL. Unlike client/server systems, though, 
most of the processing for these types of systems is done by a mainframe or 
another high-powered machine. The terminals or PCs that are connected to the 
system do little or no work. 
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An application that uses an application server 
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Description 


e In addition to a database server and clients, a client-server system can include additional 
servers, such as application servers and web servers. 


e Application servers are typically used to store business components that do part of the 
processing of the application. In particular, these components are used to process data- 
base requests from the user interface running on the client. 


e Web Servers are typically used to store web applications and web services. Web applica- 
tions are applications that are designed to run on a web server. Web services are like 
business components, except that, like web applications, they are designed to run on a 
web server. 


e Ina web-based system, a web browser running on a client sends a request to a web 
server over the Internet. Then, the web server processes the request and passes any 
requests for data on to the database server. 


e More complex system architectures can include two or more application servers, web 
servers, and database servers. 


Figure 1-3 Other client/server architectures 
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An introduction to the relational data- 
base model 


In 1970, Dr. E. F. Codd developed a model for a new type of database called 
a relational database. This type of database eliminated some of the problems 
that were associated with standard files and other database designs. By using the 
relational model, you can reduce data redundancy, which saves disk storage and 
leads to efficient data retrieval. You can also view and manipulate data in a way 
that is both intuitive and efficient. Today, relational databases are the de facto 
standard for database applications. 


The model for a relational database states that data is stored in one or more 
tables. It also states that each table can be viewed as a two-dimensional matrix 
consisting of rows and columns. This is illustrated by the relational table in 
figure 1-4. Each row in this table contains information about a single vendor. 

In practice, the rows and columns of a relational database table are often 
referred to by the more traditional terms, records and fields. In fact, some 
software packages use one set of terms, some use the other, and some use a 
combination. In this book, I use the terms rows and columns because those are 
the terms used by Oracle. 

In general, each table is modeled after a real-world entity such as a vendor 
or an invoice. Then, the columns of the table represent the attributes of the 
entity such as name, address, and phone number. And each row of the table 
represents one instance of the entity. A value is stored at the intersection of each 
row and column, sometimes called a cell. 

If a table contains one or more columns that uniquely identify each row in 
the table, you can define these columns as the primary key of the table. For 
instance, the primary key of the Vendors table in this figure is the vendor_id 
column. In this example, the primary key consists of a single column. However, 
a primary key can also consist of two or more columns, in which case it’s called 
a composite primary key. 

In addition to primary keys, some database management systems let you 
define additional keys that uniquely identify each row in a table. If, for example, 
the vendor_name column in the Vendors table contains unique data, it can be 
defined as a non-primary key. In Oracle, this is called a unique key. 

Indexes provide an efficient way of accessing the rows in a table based on 
the values in one or more columns. Because applications typically access the 
rows in a table by referring to their key values, an index is automatically created 
for each key you define. However, you can define indexes for other columns as 
well. If, for example, you frequently need to sort the Vendor rows by zip code, 
you can set up an index for that column. Like a key, an index can include one or 
more columns. 
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The Vendors table in an Accounts Payable database 


Primary key 


Columns 


VENDOR_ID |D venor_nawe |A VENDOR_ADDRESS1 a YENDOR_ADDRESS2 WENDOR_CIT 
1 1US Postal Service Attn: Supt. Window Ser... PO Box 7005 Madison 
2 2 National Informatio... PO Box 96621 NULL Washington 
3 3 Register of Copyrig... Library Of Congress NULL Washington 
4 4 Jobtrak 1990 Westwood Blvd St... NULL Los Angeles 
5 5 Newbrige Book Clu... 3000 Cindel Drive NULL Washington 
6 6 California Chamber... 3255 Ramos Cir NULL Sacramento 
7 7 Towne Advertiser’... Kevin Minder 3441 WY Macarthur Blvd Santa Ana Rows 
8 8 BFI Industries PO Box 9369 NULL Fresno 
3 9 Pacific Gas & Elect... Box 52001 NULL San Francisco 
10 10 Robbins Mobile Loc... 4669 N Fresno NULL Fresno 
11 11 Bill Marvin Electric l... 4563 E Home NULL Fresno 
12 12 City Of Frasno PO Box 2069 NULL Frasno 
13 13 Golden Eagle Insur... PO Box 85826 NULL San Diego 
14 Expedata Inc 4420 N. First Street, Sult... NULL Fresno 


Concepts 


e A relational database consists of tables. Tables consist of rows and columns, which can 
also be referred to as records and fields. 


e A table is typically modeled after a real-world entity, such as an invoice or a vendor. 


e Acolumn represents some attribute of the entity, such as the amount of an invoice or a 
vendor’s address. 


e A row contains a set of values for a single instance of the entity, such as one invoice or 


one vendor. 


e The intersection of a row and a column is sometimes called a cell. A cell stores a single 


value. 


e Most tables have a primary key that uniquely identifies each row in the table. The 


primary key is usually a single column, but it can also consist of two or more columns. If 
a primary key uses two or more columns, it’s called a composite primary key. 


e In addition to primary keys, some database management systems let you define one or 
more non-primary keys. In Oracle, these keys are called unique keys. Like a primary key, 
a non-primary key uniquely identifies each row in the table. 


e A table can also be defined with one or more indexes. An index provides an efficient way 
to access data from a table based on the values in specific columns. An index is auto- 
matically created for a table’s primary and non-primary keys. 


Figure 1-4 How a database table is organized 
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How tables are related 


The tables in a database can be related to other tables by values in specific 
columns. The two tables shown in figure 1-5 illustrate this concept. Here, each 
row in the Vendors table is related to one or more rows in the Invoices table. 
This is called a one-to-many relationship. 

Typically, relationships exist between the primary key in one table and the 
foreign key in another table. The foreign key is simply one or more columns ina 
table that refer to a primary key in another table. In this figure, for example, the 
vendor_id column is the foreign key in the Invoices table and is used to create 
the relationship between the Vendors table and the Invoices table. 

When you define a foreign key for a table in Oracle, you can’t add rows to 
the table with the foreign key unless there’s a matching primary key in the 
related table. For example, if you try to add a row to the Invoices table with a 
vendor_id value that doesn’t exist in the Vendors table, Oracle won’t add the 
row and will display an error. This helps to maintain the integrity of the data 
that’s stored in the database. 

Although one-to-many relationships are the most common, two tables can 
also have a one-to-one or many-to-many relationship. If a table has a one-to-one 
relationship with another table, the data in the two tables could be stored in a 
single table. Because of that, one-to-one relationships are used infrequently. 

In contrast, a many-to-many relationship is usually implemented by using 
an intermediate table that has a one-to-many relationship with the two tables in 
the many-to-many relationship. In other words, a many-to-many relationship 
can usually be broken down into two one-to-many relationships. 
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The relationship between the Vendors and Invoices tables in the database 


Primary key 


VENDOR_ID |] VENDOR Name |} VENDOR _ADDRESS1 |g VENDOR_ADDRESS2 |Í] VENDOR_cIT 


113 114 Postmaster Postage Due Technician 1900 E Street Fresno 

114 115 Roadway Package... Dept La 21095 NULL Pasadena 
115 116 State of California Employment Developmen... PO Box 826276 Sacramento 
116 117 Suburban Propane 28745 Cherry Ave NULL Fresno 

117 118 Unocal P.O. Box 860070 NULL Pasadena 
118 119 Yesmed, Inc PO Box 2061 NULL Fresno 

119 120 DetaformséVest 1617 W. Shaw Avenue Sulte F Fresno 

120 121 Zylka Design 3467 W Shaw Ave #103 NULL Fresno 

122 United Parcel Servi... P.O. Box 505620 NULL Reno 


30 
32 
36 
37 


Concepts 


123 4-31 4-3057 02-MA -08 13.75 
30 94 203339-13 02-MA r -08 175 
123 2-000-2993 03-MAY -08 144.7 
32 89 125520-1 05-MAY -08 95 


34 
36 
37 


123 Federal Express C... P.O. Box 1140 


123 1-202-2978 
110 0-2436 
123 1-200-5164 
110 0-2060 
110 0-2058 
123 963253272 


Foreign key 


Dept A 


INVOICED |f) venoor p |® nvoce numser A) NvocE pate INVOICE_TOTAL | 


06-MAY-08 
O7-MAY-08 
O7-MAY-08 
08-MAY -08 
08-MAY -08 
09-MAY -08 


Memphis 


33 res 
40976.08| 
63.4 
2351 7.58 
37966.19 


e The vendor_id column in the Invoices table is called a foreign key because it identifies a 
related row in the Vendors table. A table may contain one or more foreign keys. 


e When you define a foreign key for a table in Oracle, you can’t add rows to the table with 
the foreign key unless there’s a matching primary key in the related table. 


e The most common type of relationship is a one-to-many relationship as illustrated by the 
Vendors and Invoices tables. A table can also have a one-to-one relationship or a many- 


to-many relationship with another table. 


Figure 1-5 


How the tables in a relational database are related 
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How columns in a table are defined 


When you define a column in a table, you assign properties to it as indicated 
by the design of the Invoices table in figure 1-6. The most critical property for a 
column is its data type, which determines the type of information that can be 
stored in the column. With Oracle, you can choose from the data types listed in 
this figure as well as several other data types that are described in chapter 8. As 
you define each column in a table, you generally try to assign the data type that 
will minimize the use of disk storage because that will improve the performance 
of the queries later. 

In addition to a data type, you must identify whether the column can store a 
null value (or just null). A null represents a value that’s unknown, unavailable, 
or not applicable. If you don’t allow null values for a column, you must provide 
a value for that column when you store a new row in the table. 

You can also assign a default value to each column. Then, that value is 
assigned to the column if another value isn’t provided. You'll learn more about 
how to work with nulls and default values later in this book. 
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The columns of the Invoices table 

i Oracle SQI Developer, A : i y masi 
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B W Columns Data | Constraints | Grants | Statistics | Triggers |Dependencies | Details |indexes IS@L 


E f connections # a] Actions... 

Column Name  |@ Deta Type Nulabie Data Defaut |B) COLUMN ID |E Primary key A COMMENTS 
INVOICE _[D NUMBER No (nul) 4 (null) 

E GENERAL _LEDGER_ACCOUNTS let oy ee F ; R 

E E INVOICE_ARCHIVE J 5 nu Chul (null) 

4) EB INVOICE LINE ITEMS INYVOICE_NUMBER YARCHAR2(50 BYTE) No (nul {null} (null) 
INVOICE_DATE DATE No (nul (null) (ruil) 

o E terms INVOICE_TOTAL NUMBER(3,2) No (null (null) (null) 

a — VENDOR_CONTACTS PAYMENT_TOTAL NUMBER(S,2) Yes (null) (null) 

eee eS CREDIT_TOTAL NUMBER(8,2) Yes Chul) (nuly 


& 168 views 
$ Gd indexes TERMS_ID NUMBER No (null) (null) 
{null} (null) 


H (BR Packages INVOICE_DUE_DATE DATE No 

A U9 Procedures PAYMENT_DATE DATE Yee {null} (null) 
E A Functions 

@ UW Queues 

H OA Queues Tables 

A Triggers 

G [B types 

@ a Sequences 

H UA Materialized Views < 


LEP) Materialized Views Logs lsa History 


) 
) 
) 
) 


owomonomnruon = 


= 


Common Oracle data types 
Description 
CHAR, VARCHAR2 A string of letters, symbols, and numbers in the ASCII character set. 
NUMBER Integer and decimal numbers that contain an exact value. 


FLOAT Floating-point numbers that contain an approximate value. 
DATE Dates and times. 


Description 


e The data type that’s assigned to a column determines the type of information that can be 
stored in the column. 


e Each column definition also indicates whether or not it can contain null values. A null 
value indicates that the value of the column is unknown. 


e Acolumn can also be defined with a default value. Then, that value is used if another 
value isn’t provided when a row is added to the table. 


Figure 1-6 How the columns in a table are defined 
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An introduction to SQL and SQL-based 
systems 


In the topics that follow, you’ll learn how SQL and SQL-based database 
management systems evolved. In addition, you’ll learn how some of the most 
popular SQL-based systems compare. 


A brief history of SQL 


Prior to the release of the first relational database management system, each 
database had a unique physical structure and a unique programming language 
that the programmer had to understand. That all changed with the advent of 
SQL and the relational database management system. 

Figure 1-7 lists the important events in the history of SQL. In 1970, Dr. E. 
F. Codd published an article that described the relational database model he had 
been working on with a research team at IBM. By 1978, the IBM team had 
developed a database system based on this model, called System/R, along with a 
query language called SEQUEL (Structured English Query Language). Al- 
though the database and query language were never officially released, IBM 
remained committed to the relational model. 

The following year, Relational Software, Inc. released the first relational 
database management system, called Oracle. This RDBMS ran on a minicom- 
puter and used SQL as its query language. This product was widely successful, 
and the company later changed its name to Oracle to reflect that success. 

In 1982, IBM released its first commercial SQL-based RDBMS, called 
SQL/DS (SQL/Data System). This was followed in 1985 by DB2 (Database 2). 
Both systems ran only on IBM mainframe computers. Later, DB2 was ported to 
other operating systems, including Unix and Windows. Today, it continues to be 
IBM’s premier database system. 

During the 1980s, other SQL-based database systems, including SQL 
Server, were developed. Although each of these systems used SQL as its query 
language, each implementation was unique. That began to change in 1989, 
when the American National Standards Institute (ANSI) published its first set of 
standards for a database query language. These standards have been revised a 
few times since then, most recently in 2003. As each database manufacturer has 
attempted to comply with these standards, their implementations of SQL have 
become more similar. However, each still has its own dialect of SQL that 
includes additions, or extensions, to the standards. 

Although you should be aware of the SQL standards, they will have little 
effect on your job as a SQL programmer. The main benefit of the standards is 
that the basic SQL statements are the same in each dialect. As a result, once 
you’ve learned one dialect, it’s relatively easy to learn another. On the other 
hand, porting applications that use SQL from one type of database to another 
often requires substantial changes. 
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Important events in the history of SQL 


Dr. E. F. Codd developed the relational database model. 


IBM developed the predecessor to SQL, called Structured English Query Language (SEQUEL). 
This language was used on a database system called System/R, but neither the system nor the 
query language was ever released. 


Relational Software, Inc. (later renamed Oracle) released the first relational DBMS, Oracle. 
IBM released their first relational database system, SQL/DS (SQL/Data System). 

IBM released DB2 (Database 2). 

Microsoft released SQL Server. 

The American National Standards Institute (ANSI) published the first set of standards for a 


database query language, called ANSI/ISO SQL-89, or SQL1. These standards were similar to 
IBM’s DB2 SQL dialect. Because they were not stringent standards, most commercial products 
could claim adherence. 


ANSI published revised standards (ANSI/ISO SQL-92, or SQL2) that were more stringent than 
SQL1 and incorporated many new features. These standards introduced levels of compliance, or 
levels of conformance, that indicated the extent to which a dialect met the standards. 

ANSI published SQL3 (ANSI/ISO SQL:1999). These standards incorporated new features, 
including support for objects. Levels of compliance were dropped and were replaced by a core 
specification that defined the essential elements for compliance, plus nine packages. Each 
package is designed to serve a specific market niche. 

ANSI published SQL4 (ANSIASO SQL:2003). These standards introduced XML-related features, 
standardized sequences, and identity columns. 


Description 


e Although SQL is a standard language, each vendor has its own SQL dialect, or variant, 
that may include extensions to the standards. 


How knowing “standard SQL” helps you 
e The most basic SQL statements are the same for all SQL dialects. 
e Once you have learned one SQL dialect, you can easily learn other dialects. 


How knowing “standard SQL” does not help you 


e Any non-trivial application will require modification when moved from one SQL 
database to another. 


Figure 1-7 A brief history of SQL 
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A comparison of Oracle, DB2, and Microsoft SQL 
Server 


Although this book is about Oracle, you may want to know about some of 
the other SQL-based relational database management systems. Figure 1-8 
compares two of the most popular, DB2 and Microsoft (MS) SQL Server, with 
Oracle. 

One of the main differences between the Oracle and SQL Server is that 
Oracle runs under most operating systems, including z/OS, Unix, and Windows. 
In contrast, SQL Server only runs under the Windows operating system, Since 
many developers consider z/OS and Unix to be more stable and secure than 
Windows, most large companies use z/OS or Unix as the operating system for 
the servers that store the databases for mission-critical applications. As a result, 
they can’t use SQL Server and must use Oracle or DB2. 

Oracle has a huge installed base of customers and continues to dominate the 
marketplace, especially for servers running the Unix operating system. Oracle 
works well for large systems and has a reputation for being extremely reliable, 
but also has a reputation for being expensive and difficult to use. 

DB2 was originally designed to run on IBM mainframe systems and contin- 
ues to be the premier database for those systems. It also dominates in hybrid 
environments where IBM mainframes and newer servers must coexist. Although 
it has a reputation for being expensive, it also has a reputation for being reliable 
and easy to use. 

SQL Server is widely used for small- to medium-sized departmental sys- 
tems. It has a reputation for being inexpensive and easy to use. However, it also 
has a reputation for not scaling well for systems with a large number of users. 

Of course, all three of these databases are constantly evolving and compet- 
ing for different segments of the database market. For example, some develop- 
ers say that the latest versions of Oracle and DB2 are easier to use. In addition, 
Oracle and DB2 have both introduced editions of their databases that are less 
expensive. As a result, Oracle and DB2 may begin to take some market share 
away from SQL Server for medium-sized departmental systems. Conversely, 
other developers say that the latest version of SQL Server has improved secu- 
rity, reliability, performance, and scalability. As a result, they claim that SQL 
Server is ready to be used for mission-critical systems with a large number of 
users. 


MySQL and other SQL-based systems 


If you search the Internet, you’ll find that dozens of other relational data- 
base products are also available. These include proprietary systems like 
Informix, Sybase, and Teradata. They also include open source systems like 
MySQL and PostgreSQL, which can be run on most operating systems includ- 
ing Unix and Linux. The source code for open source systems is available to the 
public and can usually be used and redistributed for free. 
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A comparison of Oracle, DB2, and Microsoft SQL Server 
Oracle MS SQL Server 
Release Year 1979 1985 1987 
Platforms Unix OS/390, z/OS Windows 


OS/390, z/OS Unix 
Windows Windows 


Description 


e Oracle is typically used for large, mission-critical, systems that run on one or more Unix 
servers. 


e DB2 is typically used for large, mission-critical systems that run on legacy IBM main- 
frame systems using the z/OS or OS/390 operating system. 


e Microsoft (MS) SQL Server is typically used for small- to medium-sized systems that 
run on one or more Windows servers. 


e MySQL is a popular open-source database that runs on all major operating systems and 
is commonly used for web applications. 


Figure 1-8 A comparison of Oracle, DB2, and Microsoft SQL Server 
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The SQL statements 


In the topics that follow, you’ll learn about some of the SQL statements 
provided by Oracle. As you’ll see, you can use some of these statements to 
manipulate the data in a database, and you can use others to work with database 
objects. Although you may not be able to code these statements after reading 
these topics, you should have a good idea of how they work. Then, you’ll be 
better prepared to learn the details of coding these statements when they’re 
presented in the rest of this book. 


An introduction to the SQL statements 


Figure 1-9 summarizes some of the most common SQL statements. As you 
can see, these statements can be divided into two categories. The statements that 
work with the data in a database are called the data manipulation language 
(DML). These statements are presented in the first group in this figure, and these 
are the statements that application programmers use the most. 

The statements that work with the objects in a database are called the data 
definition language (DDL). On large systems, these statements are used exclu- 
sively by database administrators, or DBAs. It’s the DBA’s job to maintain 
existing databases, tune them for faster performance, and create new databases. 
On smaller systems, though, the SQL programmer may fill the role of the DBA. 
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SQL statements used to work with data (DML) 
Statement Description 


Retrieves data from one or more tables. 
Adds new rows to a table. 

Changes existing rows in a table. 
Deletes existing rows from a table. 


SQL statements used to work with database objects (DDL) 
CREATE USER Creates a new user for a database. 
CREATE TABLE Creates a new table in a database. 
CREATE SEQUENCE Creates a new sequence that automatically generates numbers. 
CREATE INDEX Creates a new index for a table. 


ALTER USER Changes the definition of an existing user. 
ALTER TABLE Changes the definition of an existing table. 
ALTER SEQUENCE Changes the definition of an existing sequence. 
ALTER INDEX Changes the structure of an existing index. 


DROP USER Deletes an existing user. 
DROP TABLE Deletes an existing table. 
DROP SEQUENCE Deletes an existing sequence. 
DROP INDEX Deletes an existing index. 


GRANT Grants privileges to a user. 
REVOKE Revokes privileges from a user. 


Description 


e The SQL statements can be divided into two categories: the data manipulation language 
(DML) that lets you work with the data in the database and the data definition language 
(DDL) that lets you work with the objects in the database. 

e SQL programmers typically work with the DML statements, while database administra- 
tors (DBAs) use the DDL statements. 


Figure 1-9 An introduction to the SQL statements 
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How to work with database objects 


To give you an idea of how you use the DDL statements shown in the 
previous figure, figure 1-10 presents some examples. Here, the first statement 
creates a user named AP, Then, the second statement grants all privileges to this 
user. As a result, this user can create and delete the tables and other database 
objects for the AP database that’s used in many of the examples throughout this 
book. In addition, this user can select, insert, update, and delete data that’s 
stored in these tables. 

The third example creates the Invoices table that’s used throughout this 
chapter. If you don’t understand all of this code right now, don’t worry. You’ll 
learn how to code statements like this in chapter 10. For now, just realize that 
this statement defines each column in the table, including its data type, whether 
or not it allows null values, and its default value if it has one. 

In addition, the third example defines the primary and foreign key columns 
for the table. These definitions are one type of constraint. Since the Invoices 
table includes foreign keys to the Vendors and Terms tables, these tables must 
be created before the Invoices table. Conversely, before you can delete the 
Vendors and Terms tables, you must delete the Invoices table. 

The fourth statement in this figure changes the Invoices table by adding a 
column to it. Like the statement that created the table, this statement specifies 
all the attributes of the new column. Then, the fifth statement deletes the column 
that was just added. 

The sixth statement creates an index on the Invoices table. In this case, the 
index is for the vendor_id column, which is used frequently to access the table. 
Then, the seventh statement deletes the index that was just added. 

The last statement creates a sequence that can be used to automatically 
generate a value for the invoice_id column of the Invoices table. In particular, 
this sequence begins with a value of 115 and increments the sequence by 1 each 
time the sequence is used. In figure 1-13, you can see how a sequence can be 
used when inserting a row into the Invoices table. 
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A statement that creates a new user for a database 
CREATE USER ap IDENTIFIED BY ap 


A statement that grants privileges to a user 
GRANT ALL PRIVILEGES TO ap 


A statement that creates a new table 


CREATE TABLE invoices 
{ 


invoice id NUMBER NOT NULL, 

vendor_id NUMBER NOT NULL, 

invoice number VARCHAR2 (50) NOT NULL, 

invoice_date DATE NOT NULL, 

invoice total NUMBER (9, 2) NOT NULL, 

payment total NUMBER (9, 2) DEFAULT 0, 
credit total NUMBER (9, 2) DEFAULT 0, 
terms_id NUMBER NOT NULL, 

invoice due date DATE NOT NULL, 

payment date DATE, 


CONSTRAINT invoices pk 
PRIMARY KEY (invoice id), 

CONSTRAINT invoices fk vendors 
FOREIGN KEY (vendor_id) 
REFERENCES vendors (vendor id), 

CONSTRAINT invoices_fk_terms 
FOREIGN KEY (terms id) 
REFERENCES terms (terms_id) 

) 


A statement that adds a new column to a table 


ALTER TABLE invoices 
ADD balance _due NUMBER (9,2) 


A statement that deletes the new column 


ALTER TABLE invoices 
DROP COLUMN balance due 


A statement that creates an index on the table 


CREATE INDEX invoices _vendor_id index 
ON invoices (vendor_id) 


A statement that deletes the new index 


DROP INDEX invoices _vendor_id_index 


A statement that creates a sequence for generating invoice_id values 


CREATE SEQUENCE invoice id seq 
START WITH 115 
INCREMENT BY 1 


Figure 1-10 Typical statements for working with database objects 
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How to query a single table 


Figure 1-11 shows how to use a SELECT statement to query a single table 
in a database. To start, this figure shows some of the columns and rows of the 
Invoices table. Then, in the SELECT statement that follows, the SELECT clause 
names the columns to be retrieved, and the FROM clause names the table that 
contains the columns, called the base table. In this case, six columns will be 
retrieved from the Invoices table. 

Note that the last column, balance_due, is calculated from three other 
columns in the table. In other words, a column by the name of balance_due 
doesn’t actually exist in the database. This type of column is called a calculated 
value, and it exists only in the results of the query. 

In addition to the SELECT and FROM clauses, this SELECT statement 
includes a WHERE clause and an ORDER BY clause. The WHERE clause 
gives the criteria for the rows to be selected. In this case, a row is selected only 
if it has a balance due that’s greater than zero. Finally, the returned rows are 
sorted by the invoice_date column. 

This figure also shows the result set (or result table) that’s returned by the 
SELECT statement. A result set is a logical table that’s created temporarily 
within the database. When an application requests data from a database, it 
receives a result set. 
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The Invoices base table 


P ivoicep | venvor tp | invoice_numBer | invoice_pate | invoice_totac | payMent_totac | crepit_totat |} TERMS D 

1 1 34 QP58872 25-FEB-08 116.54 118.54 0 MES 

2 2 34 0545443 14-MAR-08 1083.58 1083.58 o al” 
3 3 110 P-0608 11-APR-08 2055118 0 1200 5 
4 4 110 P-0259 18-APR-08 26881 .4 256881.4 0 3 
5 5 81 MABO1489 16-APR-08 936.93 936.93 o 3 
8 6 122 989319-497 17-APR-08 2312.2 o o 4 
7 7 82 C73-24 17-APR-08 s00 B00 o 2 
8 8 122 989319-487 16-APR-08 1927.54 o o 4 
3 3 122 989319-477 19-APR-08 2184.11 2184.11 o 4 
10 10 122 98931 9-467 24-APR-08 2318.03 2318.03 o 4 
11 11 122 989319-457 24-APR-08 3813.33 3813.33 0 3 
12 12 122 989319-447 24-APR-08 3669.99 3669.99 o 3 
13 13 122 989319-437 24-APR-08 2785.38 2765.36 o 2 
14 14 122 989319-427 25-APR-08 2115.81 2115.81 o 1 
121 97/553B 28-APR-08 0 4 


A SELECT statement that retrieves and sorts selected columns and rows 
from the Invoices table 


SELECT invoice number, invoice date, invoice total, 
payment total, credit total, 
invoice_total - payment _total - credit_total AS balance due 
FROM invoices 
WHERE invoice total - payment_total - credit total > 0 
ORDER BY invoice date 


The result set defined by the SELECT statement 


INVOICE_NUMBER | InvoIcE_paTe | invoice_totaL A payment_totat |} creDIT_toTaL |) BALANCE DUE 
1 P-0608 11-APR-08 20551.18 o 1200 19351.18 
2 989319-497 17-APR-08 2312.2 o o 2312.2 
3 989319-487 18-APR-08 1927.54 o o 1927.54 
4 975538 26-APR-08 513.55 o o 313.55 
5 97553 27-APR-08 904.14 o o 904.14 
6 97522 30-APR-08 1962.13 o 200 176213 


Concepts 


e You use the SELECT statement to retrieve selected columns and rows from a base table. 
The result of a SELECT statement is a result table, or result set, like the one shown 
above. 


e A result set can include calculated values that are calculated from columns in the table. 
e A SELECT statement is commonly referred to as a query. 


Figure 1-11 How to query a single table 
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How to join data from two or more tables 


Figure 1-12 presents a SELECT statement that retrieves data from two 
tables. This type of operation is called a join because the data from the twọ 
tables is joined together into a single result set. For example, the SELECT 
statement in this figure joins data from the Invoices and Vendors tables. 

An inner join is the most common type of join. When you use an inner join, 
rows from the two tables in the join are included in the result table only if their 
related columns match. These matching columns are specified in the FROM 
clause of the SELECT statement. In the SELECT statement in this figure, for 
example, rows from the Invoices and Vendors tables are included only if the 
value of the vendor_id column in the Vendors table matches the value of the 
vendor_id column in one or more rows in the Invoices table, If there aren’t any 
invoices for a particular vendor, that vendor won’t be included in the result set. 

Although this figure shows only how to join data from two tables, you can 
extend this syntax to join data from three or more tables. If, for example, you 
want to include line item data from a table named Invoice_Line_Items in the 
results shown in this figure, you can code the FROM clause of the SELECT 
statement like this: 


FROM vendors 
INNER JOIN invoices 
ON vendors.vendor id = involces.vendor id 
INNER JOIN invoice line items 
ON invoices.invoice id = invoice line items.invoice id 
Then, in the SELECT clause, you can include any of the columns in the 
Inyvoice_Line_Items table. 

In addition to inner joins, most relational databases including Oracle 
support other types of joins such as outer joins. An outer join lets you include 
all rows from a table even if the other table doesn’t have a matching row. You'll 
learn more about the different types of joins in chapter 4. 


Chapter 1 An introduction to relational databases and SQL 


A SELECT statement that joins data from the Vendors and Invoices tables 


SELECT vendor name, invoice number, invoice date, invoice total 
FROM vendors INNER JOIN invoices 
ON vendors.vendor id = invoices.vendor id 
WHERE invoice_total >= 500 
ORDER BY vendor name, invoice total DESC 


The result set defined by the SELECT statement 


VENDOR_NAME INVOICE_NUMBER |) INvoIcE_paTE A INVOICE_TOTAL 

7 Federal Express Corporation 963253230 415-MAY-08 739.2 
6 Ford Motor Credit Company 9982771 03-JUN-06 503.2 
9 Franchise Tax Board RTR-?2-3662-% 04-JUN-08 1600 
10 Fresno County Tax Collector PO2-88D77S7 06-JUN-08 856.92 
11 IBM 545443 14-MA4R-08 1083.58 
12 Ingram 31359783 23-MA4¥-08 1575 


13 Ingram 31361833 23-M4Y-08 579.42 
14 Malloy Lithographing Inc 0-2058 08-MAY-08 37966.19 
15 Malloy Lithographing Inc P.0259 16-A4PR-08 268814 
16 Malloy Lithographing Inc 0-2060 08-MAY -08 23517.58 
17 Malloy Lithographing Inc P-0608 11-4PR-08 20551.18 
18 Malloy Lithographing Inc 0-2436 07-MAY-08 10976.08 
19 Pollstar 77290 04-JUN-08 1750 


Concepts 
e A join lets you combine data from two or more tables into a single result set. 


e The most common type of join is an inner join. This type of join returns rows from both 
tables only if their related columns match. 


e An outer join returns rows from one table in the join even if the other table doesn’t 
contain a matching row. 


Figure 1-12 How to join data from two or more tables 
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How to add, update, and delete data in a table 


Figure 1-13 shows how you can use the INSERT, UPDATE, and DELETE 
statements to modify the data in a table. In this figure, for example, the first 
statement uses the INSERT statement to add a row to the Invoices table. To do 
that, the INSERT clause names the columns whose values are supplied in the 
VALUES clause. 

In chapter 7, you’ll learn more about specifying column names and values. 
For now, just note that you have to specify a value for a column unless it’s a 
column that allows null values or a column that’s defined with a default value. 

When working with primary keys, you often use the NEXTVAL pseudo 
column of a sequence to get the next value in a sequence of numbers that are 
automatically generated. In this figure, for example, the NEXTVAL pseudo 
column of the sequence named invoice_id_seq is used to get the value for the 
invoice_id column. To review how this sequence is defined, you can refer back 
to figure 1-10. 

The two UPDATE statements in this figure show how to change the data in 
one or more rows of a table. The first statement, for example, assigns a value of 
35.89 to the credit_total column of the invoice in the Invoices table with invoice 
number 367447. The second statement adds 30 days to the invoice due date for 
each row in the Invoices table whose terms_id column has a value of 4. 

To delete rows from a table, you use the DELETE statement. For example, 
the first DELETE statement in this figure deletes the invoice with invoice 
number 4-342-8069 from the Invoices table. The second DELETE statement 
deletes all invoices with a balance due of zero. However, since the Invoices 
table has a foreign key that references the Invoice_Line_Items table, these 
DELETE statements won’t work unless the invoice doesn’t contain any line 
items. As a result, to get these delete statements to work, you would need to 
delete the corresponding rows from the Invoice_Line_Items table first. 
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A statement that adds a row to the Invoices table 


INSERT INTO invoices 
{(invoice_id, vendor_id, invoice_number, invoice_date, 
invoice total, terms id, invoice due date) 

VALUES 
{invoice id_seq.NEXTVAL, 12, '3289175', '18-JUL-08', 
165, 3, '17-AUG-08') 


A statement that changes the value of the credit_total column for a 
selected row in the Invoices table 


UPDATE invoices 
SET credit total = 35.89 
WHERE invoice_number = '367447!' 


A statement that changes the values in invoice_due_date column for all 
invoices with the specified terms_id 


UPDATE invoices 
SET invoice due date = invoice due date + 30 
WHERE terms_id = 4 


A statement that deletes a selected invoice from the Invoices table 


DELETE FROM invoices 
WHERE invoice_number = '4-342-8069' 


A statement that deletes all paid invoices from the Invoices table 
DELETE FROM invoices 


WHERE invoice_total - payment _total - credit _total = 0 
Concepts 
e You use the INSERT statement to add rows to a table. 


e ‘You use the UPDATE statement to change the values in one or more rows of a table 
based on the condition you specify. 


e You use the DELETE statement to delete one or more rows from a table based on the 
condition you specify. 


e An INSERT, UPDATE, or DELETE statement can be referred to as an action query. 


Warning 


e If you’re new to SQL statements, please don’t execute the statements above until you 
read chapter 7 and understand the effect that these statements can have on the database. 


Figure 1-13 How to add, update, and delete data in a table 
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How to work with views 


A view is a database object that’s similar to a table. However, a view is a 
predefined query that provides another way of viewing the data that’s stored in a 
table. To create a view, you use the CREATE VIEW statement as shown in 
figure 1-14. This statement causes the SELECT statement you specify to be 
stored with the database. In this case, the CREATE VIEW statement creates a 
view named vendors_min that retrieves three columns from the Vendors table. 

To access the view, you issue a SELECT statement that refers to the view. 
This causes a virtual table to be created from the SELECT statement in the 
view. Then, the SELECT statement that referred to the view is executed on this 
virtual table to create the result set. 

You can use views to restrict the data that a user is allowed to access or to 
present data in a form that’s easier for the user to understand. In some data- 
bases, users may be allowed to access data only through views. 
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A CREATE VIEW statement for a view 


CREATE VIEW vendors min AS 
SELECT vendor_name, vendor_state, vendor_phone 
FROM vendors 


The virtual table for the view 


VENDOR_NAME YVENDOR_STATE VENDOR_PHONE 
1 US Postal Service (800) 555-1205 
2 National Information Data Ctr (301) 555-8950 
3 Register of Copyrights NULL 


4 Jobtrak (800) 555-8725 
5 Nevbrige Book Clubs (800) 555-9980 
6 California Chamber Of Commerce CA (916) 555-6670 
7 Towne Advertiser's Malling Svcs CA NULL 


A SELECT statement that uses the view 


SELECT * 

FROM vendors min 

WHERE vendor state = 'CA' 
ORDER BY vendor_name 


The result set 


VENDOR_NAME VENDOR_STATE VENDOR_PHONE 
1 ASC Signs NULL 
2 Abbey Office Furnishings (559) 555-8300 
3 American Express (800) 555-3344 


4 Aztek Label (714) 555-9000 
5 BFI Industries (559) 555-1551 
6 Bertelsmann Industry Syes. Inc (805) 555-0564 
7 Bill Jones NULL 


Description 
e A view consists of a SELECT statement that’s stored with the database. 
e When you use a view, a virtual table is created on the server that represents the view. 


Figure 1-14 How to work with views 
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SQL coding guidelines 


SQL is a freeform language. That means that you can include line breaks, 
spaces, and indentation without affecting the way the database interprets the 
code. In addition, SQL isn’t case-sensitive like some languages. That means that 
you can use uppercase or lowercase letters or a combination of the two without 
affecting the way the database interprets the code. 

Although you can code SQL statements with a freeform style, we suggest 
that you follow the coding recommendations presented in figure 1-15. The 
examples in this figure illustrate the value of these coding recommendations. 
The first example presents an unformatted SELECT statement that’s difficult to 
read. In contrast, this statement is much easier to read after our coding recom- 
mendations are applied as shown in the second example. 

The third example illustrates how to code a block comment. This type of 
comment is typically coded at the beginning of a group of statements and is 
used to document the entire group. Block comments can also be used within a 
statement to describe blocks of code, but that’s not common. 

The fourth example in this figure includes a single-line comment. This type 
of comment is typically used to document a single line of code. A single-line 
comment can be coded on a separate line as shown in this example, or it can be 
coded at the end of a line of code. In either case, the comment is delimited by 
the end of the line. 

Although many programmers sprinkle their code with comments, that 
shouldn’t be necessary if you write your code so it’s easy to read and under- 
stand. Instead, you should use comments only to clarify sections of code that 
are difficult to understand. Then, if you change the code, you should be sure to 
change the comments too. Otherwise, the comments won’t accurately represent 
what the code does, which will make the code even more difficult to understand. 
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A SELECT statement that’s difficult to read 


select invoice number, invoice date, invoice total, payment total, 
credit_total, invoice_total - payment_total - credit_total as balance due 
from invoices where invoice total - payment total - credit total > 0 order 
by invoice_date 


A SELECT statement that’s coded with a readable style 


SELECT invoice_number, invoice_date, invoice total, 
payment total, credit total, 
inveice_total - payment_total - credit_total AS balance due 
FROM invoices 
WHERE invoice _total - payment_total - credit total > 0 
ORDER BY invoice date 


SELECT statement with a block comment 
/* 
Author: Joel Murach 
Date: 8/22/2008 
*/ 
SELECT invoice number, invoice date, invoice total, 
invoice_total - payment_total - credit_total AS balance due 
FROM invoices 


A SELECT statement with a single-line comment 


-- The fourth column calculates the balance due 
SELECT invoice number, invoice_date, invoice total, 

invoice _total - payment _total - credit_total AS balance due 
FROM invoices 


Coding recommendations 

e Capitalize all keywords, and use lowercase for the other code in a SQL statement. 
e Separate the words in names with underscores, as in invoice_number. 

e Start each clause on a new line. 

e Break long clauses into multiple lines and indent continued lines. 


e Use comments only for portions of code that are difficult to understand. Then, make sure 
that the comments are correct and up-to-date. 


How to code a comment 
e To code a block comment, type /* at the start of the block and */ at the end. 
e To code a single-line comment, type -- followed by the comment. 


Description 


e Line breaks, white space, indentation, and capitalization have no effect on the operation 
of a statement. 


e Comments can be used to document what a statement does or what specific parts of a 
statement do. They are not executed by the system. 


Figure 1-15 SQL coding guidelines 
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An introduction to PL/SQL 


SQL isn’t a procedural language like Java or other procedural languages. 
However, Oracle provides an extension to standard SQL known as PL/SQL 
(Procedural Language/SQL) that allows you to write procedural code that 
includes if/else statements, loops, and error handling. This topic gives you a 
quick overview of how PL/SQL works. Then, you can learn the details of 
working with PL/SQL in section 4 of this book. 


How to work with stored procedures 


A stored procedure is a set of one or more SQL or PL/SQL statements that 
are stored together in a database. To create a stored procedure, you use the 
CREATE PROCEDURE statement as shown in figure 1-16. In this figure, the 
stored procedure begins by defining two parameters. The first is named 
invoice_number_param and is of the VARCHAR2 type. The second is named 
credit_total_param and is of the NUMBER data type. Note that these data types 
correspond with the data types that are used for the invoice_number and 
credit_total columns of the Invoices table. 

The body of this stored procedure contains two statements. The first state- 
ment is an UPDATE statement that updates the credit_total column of the 
Invoices table. The second statement is a COMMIT statement that makes the 
update permanent. 

If the body of the stored procedure encounters an error, the EXCEPTION 
block is executed. In that case, the ROLLBACK statement rolls back any 
changes that the stored procedure has made to the database so far. 

When you code the statements in a stored procedure, you must code a 
semicolon at the end of each statement. In this figure, for example, a semicolon 
is coded after the UPDATE statement, the COMMIT statement, and the ROLL- 
BACK statement. A semicolon is also coded after the END statement that ends 
the stored procedure. Finally, you must code a front slash (/) at the end of a 
stored procedure. 

To use the stored procedure, you send a request for it to be executed. One 
way to do that is to use the CALL statement shown in this figure. You can also 
execute a stored procedure from an application program by issuing an appropri- 
ate statement. How you do that depends on the programming language and the 
API you’re using to access the database. Either way, when the server receives 
the request, it executes the stored procedure. 
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A CREATE PROCEDURE statement 


CREATE OR REPLACE PROCEDURE update_invoices credit total 
( 
invoice number param VARCHAR2, 
credit_total param NUMBER 
) 
AS 
BEGIN 
UPDATE invoices 
SET credit total = credit total param 
WHERE invoice number = invoice number param; 


COMMIT; 
EXCEPTION 
WHEN OTHERS THEN 
ROLLBACK; 
END; 
/ 


A statement that executes the stored procedure 


CALL update invoices credit_total ('367447', 35.89) 


Concepts 


PL/SQL (Procedural Language/SQL) is an Oracle extension to standard SQL that allows 
you to write procedural code that includes if/else statements, loops, and error handling. 


A stored procedure is a type of subprogram that contains one or more SQL or PL/SQL 
statements that have been compiled and stored with the database. 


Figure 1-16 How to work with stored procedures 
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How to work with user-defined functions 


A user-defined function (UDF) is similar to a stored procedure in many 
ways. To start, the syntax for creating a user-defined function is similar to the 
syntax for creating a stored procedure. Like a stored procedure, a user-defined 
function can accept several parameters. In addition, a function returns a value. 

However, unlike a stored procedure, a UDF can’t modify the data that’s 
stored in a table. Instead, it performs some processing and returns a value. User- 
defined functions work similarly to the built-in functions provided by Oracle 
that are described in chapter 8, but they allow programmers to create custom 
functions for a specific database. 

To create a function, you use the CREATE FUNCTION statement as shown 
in figure 1-17. This function begins by defining an input parameter named 
vendor_id_param of the INTEGER type. Then, it defines a variable named 
avg_invoice_total_var of the NUMBER type with nine digits and two decimal 
positions. 

Within the body of the user-defined function, the SELECT statement uses 
the built-in AVG function to find the average of all invoice totals for each 
vendor and store it in the variable. Then the RETURN statement returns the 
variable. 

To use a function, you can code the name of the function in a SELECT 
statement followed by a set of parentheses. Within the parentheses, you can 
code the parameters accepted by the function, separating multiple parameters 
with a comma. In this figure, the avg_invoice_total function only accepts a 
single parameter, the vendor’s ID. As a result, you can code the vendor_id 
column within the parentheses that come after the function’s name. Then, this 
column will return the average invoice amount for each vendor. 


How to work with triggers 


A trigger is a special type of stored procedure that’s executed automatically 
when an insert, update, or delete operation is executed on a table. Triggers are 
used most often to validate data before a row is added or updated, but they can 
also be used to maintain the relationships between tables. 
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A CREATE FUNCTION statement 


CREATE OR REPLACE FUNCTION avg_invoice total 
( 
vendor id param INTEGER 
) 
RETURN NUMBER 
AS 
avg_invoice total_var NUMBER(9, 2); 
BEGIN 
SELECT AvG(invoice total) 
INTO avg_invoice total _var 
FROM invoices 
WHERE vendor_id = vendor_id param; 


RETURN avg_ invoice total var; 
END; 
/ 


A statement that uses the function 


SELECT vendor_id, invoice _total, avg _invoice_total (vendor_id) 
FROM invoices 
ORDER BY vendor_id 


The result set 


VENDOR_ID INYOICE_TOTAL A¥G_INYOICE_TOTALCYENDOR_ID) 
1083.58 
116.54 

224 188 


224 188 
116 188 
856.92 

10963.66 


Concepts 


e A user-defined function (UDF) is a type of subprogram that contains one or more SQL 
or PL/SQL statements that have been compiled and stored with the database. 


Figure 1-17 How to work with user-defined functions 
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How to use SQL from an application 
program 


This book teaches you how to use SQL from within the Oracle environment. 
As you learned in the last chapter, however, SQL is commonly used from 
application programs too. So in the topics that follow, you’ll get a general idea 
of how that works. And you’ll see, it’s easy to recognize the SQL statements in 
an application program because they’re coded just as they would be if they were 
running on their own. 


Common data access models 


Figure 1-18 shows three common ways for an application to access an 
Oracle database. To access an Oracle database from a Java application, for 
example, you can use JDBC (Java Database Connectivity). This data access 
model requires a driver to communicate with Oracle. 

Since Java is commonly used to work with Oracle databases, this figure lists 
two JDBC drivers that are provided by a default installation of Oracle and are 
commonly used by Java applications. To start, the thin driver is commonly used 
by Java applications that run on a client. These types of applications include 
standalone Java applications and applets. On the other hand, the OCI driver is 
commonly used for applications that run on a server such as Java web applica- 
tions that use servlets and JSP. Although the OCI driver requires more system 
resources than the thin driver, it usually runs faster, and it provides for addi- 
tional features such as connection pooling. 

To access an Oracle database from an application written in a .NET lan- 
guage such as Visual Basic or C#, you can use Microsoft’s newest data access 
model, ADO.NET. Like JDBC, this data access model requires a driver to 
communicate with Oracle. 

To access Oracle from a Visual Basic 6 or Access application, you can use a 
data access model called ADO. This data access model was the predecessor to 
ADO.NET, and it used a protocol known as OLE DB to communicate with the 
database. 
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Common options for accessing Oracle data 


=) Le = 


Two commonly used Oracle JDBC drivers 
Description 


Thin driver Requires fewer system resources than the OCI driver. This driver is written entirely in Java 
and is commonly used for applications that run on a client. 


OCI driver Requires more system resources than the thin driver. However, it runs faster and is commonly 
used for applications that run on a server. 


Description 


e To work with the data in an Oracle database, an application uses a data access model. 
For a Java application, that model is typically JDBC (Java Database Connectivity). For a 
.NET application written in Visual Basic or C#, that model is typically ADO.NET. For an 
Access or Visual Basic 6 application, that model is typically ADO (ActiveX Data Ob- 
jects). 

e Each data access model defines a set of objects you can use to connect to and work with 
an Oracle database. For example, each of the three models shown above includes a 
Connection object that you can use to specify the information for connecting to a data- 
base. 


e The data access models require additional software, called drivers, to communicate with 
an Oracle database. For example, JDBC requires one of the drivers described above. 


Figure 1-18 | Common data access models 
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Java code that retrieves data from an Oracle 
database 


Figure 1-19 presents Java code that uses the JDBC to execute a SQL 
statement against an Oracle database. This code is from a simple application 
that retrieves and displays information from the Vendors and Invoices tables. It 
creates the JDBC objects used by the application and then uses them to display 
the data that’s retrieved. 

If you have some Java programming experience, you shouldn’t have much 
trouble understanding this code. If you don’t have Java experience, that’s fine 
too. In that case, focus on how this code uses a data access API to execute SQL 
against an Oracle database. If you want to learn more about using Java to work 
with a database, we recommend Murach’s Java SE 6 and Murach’s Java 
Servlets and JSP. 

Before you attempt to execute this code, you need to add a JAR file that 
contains the Oracle JDBC driver to your classpath. That way, Java will be able 
to find the files for the driver. The easiest way to do that is to copy the JAR file 
that contains the drivers into the JDK’s jre/lib/ext directory. For example, the 
ojdbc14.jar file contains classes for working with JDK 1.4 and 1.5. By default, 
this JAR file is installed in the directory shown in this figure when you install 
Oracle 10g Express Edition. 

For more information about Oracle’s JDBC drivers, you can open the 
readme.txt file that’s shown in this figure. This file contains information about 
the drivers that are specific to the version of Oracle that’s installed on your 
system. 

The code in this figure begins by importing all classes in the java.sql 
package. These classes define JDBC objects like the Connection object that are 
used to access an Oracle database. 

Within the main method, the first block of code uses the forName method of 
the Class class to load the Oracle JDBC driver. This block of code is needed if 
you're using Oracle 10g because its database driver uses JDBC 3.0 (which is 
part of JDK 1.5). If you’re using Oracle 11g, though, you will usually want to 
omit this block of code because the 11g driver supports JDBC 4.0 (which is part 
of JDK 1.6), and JDBC 4,0 can automatically find the driver. 

After the driver has been loaded, this code defines the Connection, Com- 
mand, and ResultSet objects that are used by the application and sets them equal 
to null values. Within the try block, the first four statements create a Connection 
object. To do that, the first statement specifies a database URL that uses the thin 
driver to connect to the Express Edition (XE) of Oracle that’s running on port 
1521 on the same computer as the Java application. Then, the second and third 
statements specify the username and password for the AP user. Finally, the 
fourth statement uses the getConnection method of the DriverManager class to 
return a Connection object. 

Once the Connection object has been created, the next three statements 
execute a SELECT statement. Here, the first statement creates a Statement 
object. Then, the second statement defines the SELECT statement. Finally, the 
third statement executes the query and stores the result in the ResultSet object. 
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The readme file that describes the JDBC drivers for Oracle 
C:\oraclexe\app\oracle\product\10.2.0\server\jdbc\readme. txt 


The JAR file that contains the JDBC driver for Oracle 10g Express 
C:\oraclexe\app\oracle\product\10.2.0\server\jdbe\1ib\ojdbcl4.jar 


Java code that retrieves data from an Oracle database 


import java.sql.*; 
import java.text.NumberFormat; 


public class DBTestApp 


{ 


public static void main(String args[]) 
{ 
// Load the database driver 
// NOTE: This block is necessary for Oracle 10g (JDBC 3.0), 
// but not for Oracle llg (JDBC 4.0) 
try 
{ 


Class. forName ("oracle.jdbc.OracleDriver") ; 


catch (ClassNotFoundException e) 


{ 
} 


// define common JDBC objects 
Connection connection = null; 
Statement statement = null; 
ResultSet rs = null; 


e.printStackTrace(); 


try 
{ 
// Connect to the database 
String dbUrl = "jdbc:oracle: thin: @localhost:1521:XE"; 
String username = "ap"; 
String password = "ap"; 


connection = DriverManager.getConnection ( 
dbUrl, username, password); 


// Execute a SELECT statement 

statement = connection.createsStatement (); 

String query = 
"SELECT vendor name, invoice number, invoice total " + 
"FROM vendors INNER JOIN invoices " + 
= ON vendors.vendor_id = invoices.vendor_id " + 
"WHERE invoice total >= 500 "+ 
"ORDER BY vendor name, invoice total DESC"; 

re = statement.executeQuery (query) ; 


Figure 1-19 Java code that retrieves data from an Oracle database (part 1 of 2) 
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After the result of the SELECT statement has been stored in the ResultSet 
object, a while loop is used to process the values that are stored in the ResultSet 
object. To do that, the getXxxx methods of the ResultSet object are used to 
retrieve the values that are stored in the vendor_name, invoice_number, and 
invoice_total columns. Here, the getString method is used to get the 
VARCHAR? type and the getDouble method is used to get the NUMBER type. 
Finally, the NumberFormat class is used to apply currency formatting to the 
invoice_total column, and the values are printed to the console. 

If this code encounters an error, the catch block will be executed. In this 
figure, that block of code prints information about the error to the console. But 
whether or not this code executes successfully, the finally block will close all of 
the resources that were used to execute the SELECT statement and process its 
results. 

Now that you’ve reviewed this code, you can also see that only one state- 
ment in this figure uses SQL. That’s the statement that specifies the SELECT 
statement to be executed. Of course, if an application updates data, it can 
execute INSERT, UPDATE, and DELETE statements as well. With the skills 
that you’ll learn in this book, though, you won’t have any trouble coding the 
SQL statements you need. 

However, you can also see that there’s a lot involved in accessing an Oracle 
database from an application program. That’s why most application program- 
mers use a framework that makes it easier to execute SQL statements against a 
database. In some cases, application programmers write their own utility classes 
or data access classes. In other cases, application programmers use an existing 
framework such as Hibernate. 
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Java code that retrieves data from an Oracle database Page 2 


// Display the results of a SELECT statement 
System.out.println("Invoices with totals over 500:\n"); 
while (rs.next ()) 


String vendorName = rs.getString("vendor name"); 
String invoiceNumber = rs.getString("invoice_number") ; 
double invoiceTotal = rs.getDouble("invoice total") ; 


NumberFormat currency = NumberFormat.getCurrencyInstance() ; 
String invoiceTotalString = currency. format (invoiceTotal) ; 


System.out.printlin( 
"Vendor: " + vendorName + "\n" + 
"Invoice No: " + invoiceNumber + "\n" + 
"Total: " + invoiceTotalString + "\n")); 


} 


catch (SQLException e) 
e.printStackTrace()}; 


finally 
try 


if (rs l= null) 
ra.close(); 

if (statement l= null) 
atatement.close({); 

if (connection l= null) 
connection.close({); 


catch (SQLException e) 


@.printStackTrace(); 


} 


Description 


e Before you can use Java to work with Oracle, you must make a JAR file that contains a 
JDBC driver available to your system. The easiest way to do that is to copy the JAR into 
the JDK’s jre/lib/ext directory. 


e The ojdbc14.jar file contains classes for use with JDK 1.4 and 1.5 including the JDBC 
driver classes. 


e To execute a SQL statement from a Java application, you can use JDBC objects like the 
Connection, Statement, and ResultSet objects. 


Figure 1-19 Java code that retrieves data from an Oracle database (part 2 of 2) 
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Perspective 


To help you understand how SQL is used from an application program, this 
chapter has introduced you to the hardware and software components of a client/ 
server system. It has also described how relational databases are organized and 
how you use some of the SQL statements to work with the data in a relational 
database. With that as background, you’re now ready to start using Oracle. In the 
next chapter, then, you’ll learn how to use some of the tools for working with an 
Oracle database. 

Although this book shows how to work with a relational database, another type 
of database that has come into use recently is the object database. This type of 
database is designed to store and retrieve the objects that are used by applications 
written in an object-oriented programming language such as Java or C++. Al- 
though object databases have some advantages over relational databases, they also 
have some disadvantages. In general, object databases haven’t yet become widely 
used, but they have acquired a niche in some areas such as engineering, telecom- 
munications, financial services, high energy physics, and molecular biology. 

If you read more about Oracle’s advanced features, you’ll see that Oracle 
provides some capabilities for working with objects that allow it to work like an 
object database. For example, Oracle allows you to define custom data types such 
as an Invoice data type. Then, you can store an Invoice object within a database 
without having to parse the object so you can store its values within multiple 
columns of a row. However, since these features haven’t yet come into widespread 
use, they aren’t covered in this book. 
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Terms 


client 

server 

database server 

network 

client/server system 

local area network (LAN) 
enterprise system 

wide area network (WAN) 
database management system (DBMS) 
back end 

application software 

application programming interface (API) 
data access API 

JDBC (Java Database Connectivity) 
front end 

SQL (Structured Query Language) 
query 

query results 

application server 

web server 

business component 

web application 

web service 

web browser 

relational database 

table 

row 

column 

record 

field 

cell 

value 

primary key 

composite primary key 
non-primary key 
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unique key 

index 

foreign key 

one-to-many relationship 
one-to-one relationship 
many-to-many relationship 
data type 

null value 

null 

default value 

SQL dialect 

SQL extension 

Open source system 

data manipulation language (DML) 
data definition language (DDL) 
database administrator (DBA) 
constraint 

sequence 

base table 

result table 

result set 

calculated value 

join 

inner join 

outer join 

action query 

view 

virtual table 

comment 

block comment 

single-line comment 

PL/SQL (Procedural Language/SQL) 
stored procedure 

user-defined function (UDF) 
trigger 
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How to use 
Oracle SQL Developer 
and other tools 


In the last chapter, you learned about some of the SQL statements that you can 
use to work with the data in a relational database. Before you learn the details 
of coding these statements, however, you need to learn how to work with an 
Oracle database, how to use Oracle SQL Developer to enter and execute SQL 
statements, and how to use the SQL Reference manual. 


How to work with an Oracle database .........sssssssrsssrssses 48 
How to start and stop the database service .....ccsssscssssescssesssseecscescaceseesnceseans 48 
How to use the Database Home Page... sidiiscssssssssosssanssscessersasorsnsshagnionsncvesnisne 50 
Howto irs ES PIES viaria aaan iiaiai 52 
How to use SQL Developer to work with a database ........ 54 
How to'createraidatabase connection 0.) cisisiesseccsacinessniioiavaissansanssasustsprabaaianie 54 
How to export or import database connections ......scsccssessceeeseescercereeneerenee 54 
How to navigate through the database Objects ......cccscssessercceneseescseserseerenee 56 
How to view the column definitions for a table .........cssssssssessscssersersersers 58 
Howatoiview: the datatfora table niiin ener ra sinoati 58 
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How to work with an Oracle database 


Since Oracle Database 11g Express Edition has not been released as of 
press time for this book, this topic uses Oracle Database 10g Express Edition to 
illustrate the skills for working with the Oracle Database. However, when the 
Express Edition of 11g becomes available, you should be able to use similar 
techniques to work with that version of the Oracle Database. 


How to start and stop the database service 


If you installed the Express Edition of the Oracle Database on your com- 
puter as described in appendix A, the database service starts automatically 
when you start your computer. This piece of software is often referred to as the 
database server, or the database engine. It receives SQL statements that are 
passed to it, processes them, and returns the results. 

The database listener also starts automatically when you start your com- 
puter. This piece of software listens for requests from remote clients and returns 
the results to them. 

From time to time, however, you may want to stop the database. If, for 
example, you aren’t going to be using the database and you want to free the 
resources on your computer, you can stop the database. Or, if the port that is 
being used by the Oracle Database conflicts with another program, you can stop 
the database. Then, when you want to work with the database again, you can 
start it. 

The easiest way to stop the database service is to use the Stop Database 
command that’s available from the Windows Start menu as described in figure 
2-1. When you select this item on a Windows system, a DOS window will be 
displayed that indicates that the Oracle service is stopping. Then, the DOS 
window will display a message when the Oracle service has successfully 
stopped. Although this doesn’t stop the database listener, the database listener 
won’t be able to return any results unless the database service is running. 

When you’re running the Oracle Database on your own computer for 
training purposes, you can stop the database whenever you want. However, if a 
database is running in a production environment, you should make sure that all 
users are logged off and that no applications are using the database before you 
stop the database. 

The easiest way to start the database service and listener is to use the Start 
Database command that’s available from the Windows Start menu. When you 
select this command on a Windows system, a DOS window will be displayed 
that indicates the status of the Oracle listener and service. 


Chapter 2 How to use Oracle SQL developer and other tools 


How to stop the database 
e Start>All Programs»Oracle Database 10g Express Edition» Stop Database 


The DOS window that’s displayed when the database is being stopped 
Stop Datsbase a 


Afi 
Ct \orac leac\app oracle \prodagt*16.2.8\server\BINonet stop OracleServicexE àil 
The OracleServiceRE service is stopping 
The OracieSeeviceXE servive was stopped successfully Fa 


Ct \orac lexe \app oracle \prodeotSi@.2 _G@\server\BIN> 


Æ 


How to start the database 
e Start>All Programs»Oracle Database 10g Express Edition Start Database 


The DOS window that’s displayed when the database is started 


(> sarac lose sapphorad le\product. 2 @\ycerverSf!N net start OracleXETNSListener t 
he requested cervite hes already been started 


ore help is available by typing NET NELPMSG 2182. | 


poracie\product\16.2.8\server\BIM>net start OracleServicekE 
iceKE servioe is starting 
Jiee XE service was started successfully 


>> worac lexo\4ppvordc le \product 16.2 server JIN), 


al 


Description 


e After you install the Oracle Database, the database service and database listener will 
start automatically each time you start your computer. The database service can also be 
referred to as the database server or the database engine. 


e To stop or start the database server and listener, you can use the commands that are 
available from the Windows Start menu. 


e When Oracle Database 11g Express Edition becomes available, you should be able to 
use a similar technique to stop and start that version of the Oracle Database. 


Figure 2-1 How to start and stop the database service 
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How to use the Database Home Page 


Figure 2-2 shows how to use a web-based tool known as the Database 
Home Page to work with an Oracle database. This tool is installed when you 
install the Oracle Database as described in appendix A, and it’s useful for 
handling some tasks. In particular, it provides a way to create a new user for the 
database. 

To begin, you can start the Database Home Page by selecting the Go To 
Database Home Page command from the Windows Start menu. Then, you can 
log in as the system user. If you followed the advice of appendix A, that means 
you'll use “system” as the username and “system” as the password. 

After you’ve logged in, you can use the Create User command and the 
resulting web page to create a new user. When you create a new user, you 
provide a username and a password. You can also limit the types of tasks that 
the user will be able to do. In chapter 12, you can learn more about this. But for 
now, you don’t need to restrict any of the user’s privileges when you create a 
new user. 

You can get a feel for the functionality that’s available from the Database 
Home Page by browsing through its Administration, Browser, SQL, and Utili- 
ties menus. If you do that, you’ll see that you can use this tool to accomplish a 
wide range of tasks. For most tasks, though, it’s easier to use the SQL Devel- 
oper tool that’s described later in this chapter. 
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The Database Home Page 
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How to start the Database Home Page 


e From the Windows Start menu, select All Programs»Oracle Database 10g Express 
Edition»Go To Database Home Page. Then, use the Database Login page to log in. 


How to use the Database Home Page to create a user for a database 


e Log in as the system user, select the Administration» Database Users»Create User 
command, and use the resulting web page to create the user. 


Description 


e The Database Home Page is a web-based tool that’s installed with the Oracle Database. 
You can use it to work with an Oracle Database. 


e After you log in, you can use the Administration, Browser, SQL, and Utilities menus to 
work with the database. 


e For most tasks, it’s easier to use the SQL Developer tool that’s described later in this 
chapter. 


Figure 2-2 How to use the Database Home Page 
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How to use SQL*Plus 


Figure 2-3 shows how to use a command-line tool known as SQL*Plus to 
work with the database. The SQL*Plus tool has been around since the earliest 
days of the Oracle database, and many Oracle developers still use it. However, 
the newer SQL Developer tool described later in this chapter is easier to learn 
and use than the SQL*Plus tool. That’s why this chapter and the rest of this 
book shows how to work with the SQL Developer tool. 

Still, there may be times when it makes sense to use SQL*Plus. If, for 
example, you are working on a computer that has SQL*Plus installed but 
doesn’t have SQL Developer installed, you may need to use SQL*Plus to run 
some SQL statements. Or, you may need to develop a batch file that uses 
SQL*Plus to run one or more SQL scripts. Figure A-5 of appendix A, for 
example, shows how to run a batch file that starts SQL*Plus and uses it to run 
four SQL scripts. In that case, of course, it makes sense to use SQL*Plus. 

If you need to start SQL*Plus, you can do that by selecting the Run com- 
mand from the Start menu, entering “sqlplus”, and selecting the OK button. 
Then, you can connect to the database as a user by entering the username and 
password. In this figure, for example, I started by connecting as the AP user. 

Once you’re connected to the database, you can run SQL statements. To do 
that, you type the SQL statement followed by a semicolon and press the Enter 
key. Then, if the statement selects data, SQL*Plus will display the data. In this 
figure, for example, I entered a SQL statement that displays the vendor_name 
for the vendor with an id of 11. 

At any time, you can connect to the database as a different user by entering 
the CONNECT command. When you enter this command, SQL*Plus prompts 
you for a username and password. In this figure, for example, I entered the 
CONNECT command to connect as the OM user. 
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The SQL*Plus tool 
| _CAoraclexe\app\oracle\product\ 


BQL*Plus: Release 14.2.4.1.8 — Production on Thu Sep 13 15:45:37 2007 


x 


Copyright <cc> 1982, 2065, Oracle. AlL rights reserved. 


Enter user-name: ap 
nter password: 


Connected to: 
Oracle Database 1@g Express Edition Release 14.2.4.1.4 — Production 


SQL> select vendor_name from vendors where vendor_id = 11; 


J ENDOR_NAME 


Bill Marvin Electric Inc 


SQL> connect 

Enter username: om 
Enter password: 
tannected. 


SQL? 


Description 


e SQL*Plus is a command-line tool that’s installed with the Oracle Database. You can use 
it to work with an Oracle Database. 


e To start SQL*Plus, select the Run command from the Start menu, enter “sqlplus”, and 
select the OK button. 


e To connect to a database, enter the username and password. If necessary, you can enter 
the CONNECT command to have SQL*Plus prompt you for a username and password. 


e Torun a SQL statement, type it, type a semicolon, and press the Enter key. 


e For most tasks, it’s easier to use the SQL Developer tool that’s described later in this 
chapter. 


Figure 2-3 How to use SQL*Plus 


54 Section 1 An introduction to SQL 


How to use SQL Developer to work 
with a database 


Oracle SQL Developer is a free graphical tool that makes it easy to work 
with Oracle databases, and it’s the tool that we recommend for working with an 
Oracle database. As you will see, this tool makes it easy for you to review or 
modify the design of a database. 

As of press time for this book, the current version of SQL Developer is 
version 1.5, so that’s the version presented in this chapter. However, with some 
minor variations, the skills presented in this chapter should work for later 
versions as well. 

When you use SQL Developer, you can connect to any Oracle Database 
version 9.2.0.1 or later. To connect to earlier versions, you need to use another 
tool such as the SQL*Plus tool described in the previous figure. 


How to create a database connection 


Before you can work with a database, you need to create a connection to the 
database. When you start SQL Developer, the Connections window displays all 
available database connections. To create a new connection, you can use the 
procedure described in figure 2-4. 

If you have installed the software for this book as described in figure A-5 of 
appendix A, the AP, OM, and EX users with passwords of AP, OM, and EX will 
be available on your system. As a result, you will be able to create connections 
for these three users. Specifically, we suggest that you use the AP user to work 
with the tables in the AP schema, the OM user to work with the tables in the 
OM schema, and the EX user to work with the tables in the EX schema. 

When you create a database connection, you should note that the usernames 
and passwords are not case-sensitive. As a result, it doesn’t matter if you enter 
the usernames and passwords in uppercase or lowercase. In this figure, for 
example, I entered the usernames and passwords in lowercase because it’s easier 
to type in lowercase. 


How to export or import database connections 


If you want to copy several database connections from one computer to 
another, you can export the database connections to an XML file as described in 
this figure. Then, you can use this XML file to import those database connec- 
tions on another computer. For example, I exported the database connections for 
the AP, OM, and EX users to this file: 

c:\murach\oracle_sql\db_setup\connections.xnl 
As a result, if you want, you can use the technique shown in this figure to 
import the connections that are stored in this file. 
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The dialog box for creating database connections 
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How to create a database connection 


1. Right-click on the Connections node in the Connections window and select the New 
Connection command to display the dialog box for creating database connections. 


2. Enter a connection name, username, and password for the connection. 


3. Click the Test button to test the connection. If the connection works, a success message 
is displayed above the Help button. 


4. Click the Save button to save the connection. When you do, the connection will be 
added to the dialog box and to the Connections window. 


How to export or import database connections 


e To export database connections, right-click the Connections node, select the Export 
Connections command, and use the resulting dialog box to select the connections that 
you want to export and to specify the path and filename for the XML file for the connec- 
tions. 


e To import connections, right-click the Connections node, select the Import Connections 
command, and use the resulting dialog box to navigate to the XML file for the connec- 
tions. Then, select the connections that you want to import. 


Figure 2-4 How to create, export, or import a database connection 
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How to navigate through the database objects 


Figure 2-5 shows how to navigate through the database objects that are 
available to the user that corresponds with the current database connection. 
These database objects include tables, views, indexes, and so on. For this 
chapter, however, you can focus on the tables. Later in this book, you’ll learn 
more about views, indexes, and other database objects. 

When you expand a connection for the first time in a session, you must 
enter the password for the username in the Connection Information dialog box. 
In this figure, for example, I clicked on the plus sign (+) to the left of the node 
for the AP connection. Then, SQL Developer prompted me for the password for 
the user named AP. When I entered the correct password, SQL Developer 
expanded the AP connection and displayed a SQL Worksheet window for the 
AP connection. 

Once you expand a connection, you can navigate through the objects that 
are available for the user that corresponds to the connection. To do that, you can 
click on the plus (+) and minus (-) signs to the left of each node to expand or 
collapse the node. In this figure, for example, I expanded the Tables node to 
view all of the tables available to the AP user. 

To work with a node or an object, you can right-click on the object to 
display a context-sensitive menu. Then, you can select a command from the 
resulting menu. For example, you can right-click on the node for the AP con- 
nection to display a list of commands for working with that connection. 
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The Connection Information dialog box 


P connection Information 


The tables available to the AP user 
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Description 


e Each connection provides access to the database objects that are available to the user 
that corresponds with the connection. These database objects include tables, views, and 
so on. 


e Before you can expand a connection for the first time in a session, you must enter the 
correct password for the connection. Once you enter a password for a connection, you 
can navigate through the nodes for the database objects. You can also use the SQL 
Worksheet window to enter and run SQL statements. 


e To navigate through the database objects for a connection, click on the plus (+) and 
minus (-) signs to the left of each node to expand or collapse the node. 


e To work with a node or an object, right-click on the node or object and select a command 
from the resulting menu. 


Figure 2-5 How to navigate through the database objects 
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How to view the column definitions for a table 


To view the column names for a table, you can expand the node for the 
table. In figure 2-6, for example, I expanded the node for the Vendors table so 
the names for all of columns in the table are displayed below the table. 

To view the column definitions for a table, you can click on the table name 
to display a table that contains the definition of each column. In this figure, for 
example, the right-hand window shows a table that contains the definitions for 
each column in the Vendors table. For each column, this table shows the column 
name, the data type, an indication of whether or not the column can contain null 
values, the default value, the position of the column within the table, whether or 
not the column is a primary key, and any column comments. By default, the 
columns are displayed in the sequence in which they were created. 


How to view the data for a table 


To view the data for a table, you just click on the Data tab after you display 
the column definitions for the table. By switching back and forth between the 
Column and Data tabs, you can quickly see how the data corresponds to the data 
definitions. 

You can also use the Data tab to modify the data in a row, and you can use 
the Insert and Delete buttons at the start of this tab to add rows or delete rows. 
Then, if you want to commit the changes to the table, you can click on the 
Commit Changes button. Or, if you want to rollback the changes, you can click 
on the Rollback Changes button. In chapter 7, you’ll learn more about commit- 
ting and rolling back changes. 
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How to view the column definitions for a table 
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How to view the column definitions for a table 


e To view the column names for a table, expand the Tables node and then expand the node 
for the table you wish to view. This displays the columns in the Connections window. 


e To view the column definitions for a table, click on the name of the table in the Connec- 
tions window. This displays detailed information about the columns of the table in the 
window to the right of the Connections window. 


e By default, the columns are displayed in the sequence in which they were created. 


How to view the data for a table 


e Click on the Data tab in the window that’s displaying the column definitions for the 
table. 


Figure 2-6 How to view the column definitions and data for a table 
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How to edit the column definitions 


If you want to edit a column definition, you can use one of the techniques 
described in figure 2-7 to display the Edit Table dialog box. Then, you can use 
this dialog box to add a column, delete a column, or modify a column. 

If you want to display additional information about a column, you can select 
the column by clicking on it. Then, additional properties are displayed in the 
Column Properties group that’s displayed on the right side of the dialog box. In 
this figure, for example, the properties for the DefaultTermsID column are 
displayed. In addition, a default value of 3 has been entered for this column. 
Note that the properties that are available change depending on the data type of 
the column. For a column with the VARCHAR2 data type, for example, the 
properties also indicate the length of the column. You’ll learn more about that in 
chapter 8. 

Most of the time, you won’t want to use SQL Developer to edit the column 
definitions for a table. Instead, you’ll want to edit the scripts that create the 
database so you can easily recreate the database later. However, if necessary, 
you can use SQL Developer to edit the column definitions for a table. In chapter 
10, you’ll learn more about creating and modifying the column definitions for a 
table. 
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Description 


è To edit the definition of a table, right-click on the table name, select the Edit command, 
and use the Edit Table dialog box to modify the table. 


e You can use the Edit Table dialog box to add a column, delete a column, or change the 
properties of an existing columm such as the name, data type, default value, and so on. 


e You can also use the Edit Tabie dialog box to add, delete, or modify a table-level con- 
straint, an index, or a table-level comment. 


e For more information about creating tables, see chapter 10. 


Figure 2-7 How to edit the column definitions 
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How to use SQL Developer to run SQL 
statements 


Besides letting you review the design of a database, SQL Developer is a 
great tool for entering and running SQL statements. That’s what you’ll learn 
how to do next. 


How to enter and execute a SQL statement 


Figure 2-8 shows how to use the SQL Worksheet window to enter and 
execute a SQL statement. The easiest way to open a SQL Worksheet window is 
to use the drop-down list that’s available from the Open SQL Worksheet button 
on the toolbar. First, you can click on the arrow to the right of this button to 
display all connections. Then, you can select the connection you want to use. 
This opens a Worksheet for that connection. But first, if the connection hasn’t 
been used in the current session, you may be prompted to enter the password for 
the connection. 

Once you open a SQL Worksheet, you can use standard techniques to enter 
or edit a SQL statement. As you enter statements, you’ll notice that SQL 
Developer automatically applies colors to various elements. For example, 
keywords are displayed in blue. This makes your statements easier to read and 
understand and can help you identify coding errors. 

When you enter SQL statements, you’ll notice that SQL Developer auto- 
matically displays a drop-down list that helps you enter SQL statements. This 
feature often provides help for entering SQL keywords, table names, column 
names, and so on. In this figure, for example, SQL Developer displayed a drop- 
down list after I entered the ORDER BY keywords and pressed the spacebar. At 
this point, you can easily select a column name from the drop-down list. 

If you experiment with this code completion feature, you’ll find that SQL 
Developer doesn’t display column names automatically until you enter the name 
of the table that’s used by the statement. As a result, if you want to use the code 
completion feature, you may want to enter the table name before you enter the 
column names. 

Most of the time, SQL Developer automatically displays the drop-down list 
after you enter some code followed by a space. Usually, that’s what you want. 
However, there are times when you may want to manually prompt SQL Devel- 
oper to display the drop-down list. To do that, you can press the Ctrl key and the 
spacebar at the same time (Ctrl+spacebar). 

In addition, you can use SQL Developer to automatically comment or 
uncomment a line. To do that, you can move the insertion point into the line. 
Then, you can press Ctrl key and the front slash at the same time (Ctrl+/). 

To execute a single SQL statement like the one in this figure, you can press 
F9 or click the Execute Statement button in the toolbar for the SQL Worksheet 
window. If the statement returns data, that data is displayed in the Results tab. In 
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VENDOR_NAME 
1 ASC Signs 
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3 Abbey Office Furnishings 
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10 Bertelsmann Industry Svcs. Inc 
11 Bill Jones 
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To open a new SQL Worksheet window, drop down the list from the Open SQL 


Worksheet button. Then, select the connection you want to use. 


To change the connection for the worksheet, use the Connections list. 
To enter a SQL statement, type it into the SQL Worksheet window. 
As you enter the text for a statement, the SQL Worksheet window applies color to 


various elements, such as SQL keywords, to make them easy to identify. 


As you enter the text for a statement, you can use the code completion feature to select 


SQL keywords, table names, column names, and so on. 


To manually display the code completion list, press Ctrl+spacebar. 
To comment out a line or to uncomment a line, press Ctrl+/. 
To execute a SQL statement, press the F9 key or click the Execute Statement button in 


the toolbar. If the statement retrieves data, the data is displayed in the Results tab of the 


SQL Worksheet window. 


Figure 2-8 
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this figure, for example, the result set returned by the SELECT statement is 
displayed. If necessary, you can adjust the height of the Results pane by drag- 
ging the bar that separates the SQL Worksheet window from the Results tab. 


How to work with the Snippets window 


Figure 2-9 shows how to use the Snippets window to enter a snippet of code 
into the SQL Worksheet window. To start, if the Snippets tab isn’t displayed on 
the right side of the SQL Developer window, you can select the Snippets 
command from the View menu to display the Snippets window. Otherwise, you 
can display the Snippets window by clicking on the Snippets tab that’s dis- 
played on the right side of the SQL Developer window. Then, you can use the 
drop-down list at the top of the Snippets window to select a category of snip- 
pets, and you can drag a snippet from the Snippets window into the SQL 
Worksheet window. At that point, you can edit the snippet code so it’s appropri- 
ate for your SQL statement. 

In this figure, for example, I dragged the COUNT(*) and SUM(expr) 
snippets into the SQL Worksheet window. Then, I edited the SUM(expr) snippet 
to replace the expr placeholder with a valid expression. 

For now, don’t worry if you don’t understand the SQL statement presented 
in this figure. The main point is that you can use the Snippets window to enter a 
variety of SQL code. As you learn more about SQL statements, you’ll see how 
useful this can be. 
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'VARIANCE(expr) 


e Ifthe Snippets tab isn’t displayed on the right side of the SQL Developer window, you 


can display it by selecting the Snippets command from the View menu. 


e To display the Snippets window, click on the Snippets tab that’s displayed on the right 
side of the SQL Developer window. 


e To display another category of snippets, select the category from the drop-down list at 
the top of the Snippets window. The snippets are organized in 10 categories including: 
Date Formats, Number Formats, Date/Time Functions, Number Functions, Character 
Functions, Conversion Functions, and Pseudocolumns. 


e To enter a snippet into your code, drag the snippet from the Snippets window into the 
SQL Worksheet window. Then, if necessary, edit the snippet code so it’s appropriate for 


your SQL statement. 


e The Snippets window will become more useful as you learn more about formats and 


functions. 


Figure 2-9 


How to use the Snippets window 
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How to handle syntax errors 


If an error occurs during the execution of a SQL statement, SQL Developer 
displays a dialog box that includes the error number, a brief description of the 
error, and the location of the error in your code. In figure 2-10, for example, the 
dialog box displays an error number of “ORA-00942” and a brief description 
that says “table or view does not exist.” This dialog box also indicates that the 
error occurred at line 2, column 5, where a table or view named Vendor is 
referenced. 

In this example, the problem is that the Vendor table doesn’t exist in the 
database. To fix the problem, you need to edit the SQL statement so the table is 
Vendors instead of Vendor. Then, you should be able to successfully run the 
SQL statement. 

This figure also lists some other common causes of errors. As you can see, 
most errors are caused by incorrect syntax. However, it’s also possible that you 
will get an error if you are connected as the wrong user. If, for example, you are 
connected as the EX user and you try to run a statement that references tables in 
the AP schema, you may get an error. Regardless of what’s causing the problem, 
you can usually identify and correct the problem without much trouble. In some 
cases, though, it may be difficult to figure out the cause of an error. Then, you 
can usually get more information about the error by searching the Internet. 
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Common causes of errors 

e Misspelling the name of a table or column 

e Misspelling a keyword 

e Omitting the closing quotation mark for a character string 
e Being connected as the wrong user 


Description 


e Ifan error occurs during the execution of a SQL statement, SQL Developer displays a 
dialog box that includes an error code, a brief description of the error, and the location of 
the error in the code. 


e Most errors are caused by incorrect syntax and can be corrected without any additional 
assistance. Otherwise, you can usually get more information about an error by searching 
for the error code or description on the Internet. 


Figure 2-10 How to handle syntax errors 
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How to open and save SQL statements 


After you get a SQL statement working the way you want it to work, you 
may want to save it. Then, you can open it and run it again later or use it as the 
basis for a new SQL statement. To save a SQL statement, you can use the 
standard Windows techniques shown in figure 2-11. 

To open a file that has been saved, you use the Open command. In this 
figure, for example, the Open dialog box shows the SQL statements that have 
been saved for chapter 3. They are saved in the scripts\ch03 directory that is 
created when you download and install the source code for this book. The 
screen in this figure shows the tabs for three files that have been opened for that 
chapter. Note that the names of these files have the sql extension. 

After you open two or more SQL worksheets, you can switch between the 
SQL statements by clicking on the appropriate tab. Or, you can select the SQL 
statement from the file list that’s available just above the Connections list. Then, 
you can cut, copy, and paste code from one SQL statement to another. 

When you open a saved SQL file, SQL Developer doesn’t set a connection 
for the SQL statement within the file. As a result, you must specify a connection 
before you can run the SQL statement by using the Connections list. If you 
don’t do that, SQL Developer will prompt you to select a connection when you 
try to run the statement. 

To save a new SQL statement in a new file, you use the Save command. To 
save a modified SQL statement in its original file, you also use the Save com- 
mand. And to save a modified SQL statement in a new file, you use the Save As 
command. 

To set the default directory that you want to use for saving new SQL state- 
ments, you can use the Tools»Preferences command that’s described in this 
figure. Note, however, that there’s no way to set the default directory for open- 
ing files. 

For both the Open and Save dialog boxes, you can specify a recently used 
directory by clicking on its icon. In this figure, for example, the Open dialog 
box shows that the ch02, ch03, and db_setup directories have all been used 
recently. As a result, you can easily specify one of these directories by clicking 
on it. 
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The Open File dialog box 
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Description 


e To open a SQL file, click the Open button in the toolbar, press Ctrl+O, or select the 
FileOpen command. Then, use the Open dialog box to locate and open the SQL file. 


e To specify a connection for a SQL statement that you open, select a connection from the 
Connections list (see figure 2-8). Otherwise, when you try to run the statement, SQL 
Developer will display a dialog box that prompts you to select a connection. 

e To switch between open statements, select the appropriate tab. Or, click on the drop- 


down arrow that’s displayed to the right of the SQL Worksheet tabs, and select the file 
name from the file list. 


e To cut, copy, and paste code from one SQL statement to another, use the standard 
Windows techniques. 


e To save a SQL statement, click the Save button in the toolbar, press Ctrl+S, or select the 
File»Save command. Then, if necessary, use the Save dialog box to specify a file name 
for the SQL statement. 


e To change the default directory for new statements that you want to save, use the 
Tools»Preferences command. Then, expand the Database node, click on the Worksheet 
Parameters node, and change the default path for scripts. 


e To specify a recently used directory in an Open or Save dialog box, click on the icon for 
the recently used directory. 


Figure 2-11 How to open and save SQL statements 
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How to enter and execute a SQL script 


A SQL script is a file that contains one or more SQL statements. So far in 
this chapter, each of the SQL files that has been presented has been a SQL script 
that contains just one SQL statement. However, a SQL script typically contains 
multiple statements. 

When you code multiple SQL statements within a script, you must code a 
semicolon at the end of each SQL statement. For example, figure 2-12 shows a 
script that contains two SQL statements. Then, you can press F5 or click the 
Run Script button to execute all of the SQL statements that are stored in the 
script. When you do, the results of the script will be displayed in the Script 
Output tab. 

However, if you want to execute a single SQL statement that’s stored within 
a script, you can do that by moving the insertion point into the statement and 
pressing the F9 key or clicking the Execute Statement button in the toolbar. 
Then, if the statement retrieves data, the data is displayed in a Results tab like 
the one in figure 2-8. In this figure, for example, the insertion point is in the first 
SQL statement, and this statement is a SELECT statement that retrieves data. As 
a result, if you press the F9 key, the result set is displayed in the Results tab. 
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A SQL script and Its resuits 
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Description 


¢ A SQL script is a file that contains one or more SQL statements. When you code a script 
that contains more than one statement, you must code a semicoion at the end of each 
Statement. 

e Torun the entire SQL script, press the F5 key or click the Run Script button that’s 
located just to the right of the Execute Statement button. The results are displayed in the 
Script Output tab. 

e To execute one SQL statement within a script, move the insertion point into the state- 
ment. Then, press the F9 key or click the Execute Statement button in the toolbar. If the 
statement retrieves data, the data is displayed in the Resuits tab. 


Figure 2-12 How to enter and execute a SQL script 
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How to use the SQL Reference manual 


Figure 2-13 shows how to use another useful tool for working with the 
Oracle Database: the SQL Reference manual. This manual is available as a 
searchable PDF file that contains detailed technical information about the SQL 
statements that work with Oracle Database. You can use this PDF file to quickly 
look up a wide variety of information about the Oracle Database including 
information about its SQL statements and functions. 


How to view the manual 


If you have the PDF file for the SQL Reference manual installed on your 
computer as described in appendix A, you can open the manual by using the 
Windows Explorer to double-click on the PDF file. This will open the PDF file 
in the Adobe Reader. Of course, this assumes that you have the Adobe Reader 
installed on your system. If you don’t, you can download it for free from 
Adobe’s web site. 


How to look up information 


Once you’ve opened the PDF file for the SQL Reference manual in the 
Adobe Reader, there are several ways that you can look up information. In 
short, you can use any standard techniques for working with a PDF file to work 
with this manual. If you need help, you can use the Adobe Reader’s Help menu 
to learn more about using Adobe Reader. 

One easy way to look up information is to scroll through the table of 
contents. To do that, click on the plus (+) and minus (-) signs in the left column 
to expand or collapse the topics. Then, when you find the topic you want, click 
on it to display the topic in the right pane. Once you do that, you can scroll up 
or down to view the entire topic. In this figure, for example, I expanded the 
topic named “SQL Statements: SAVEPOINT to UPDATE,” and I clicked on the 
SELECT topic to display information about the SQL SELECT statement. 

Another easy to way to look up information is to search for a specific word 
or phrase. To do that, type the word or phrase in the Find text box and press the 
Enter key. Then, you can click on the Find Next or Find Previous button to find 
the next or previous occurrence of the word or phrase. In this figure, for ex- 
ample, I entered “select statement” into the Find text box and pressed the Enter 
key. 
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How to use the Oracle Database SQL Reference manual 


Bi Gracte Database SOL Reference Mangal, prit -Adobe Rewder 
Fie Edt View Document Tosa iindon rep 


Se é © & elim = om} SG 


f] Bookmarks 


SOL Stat ry Use a or eubquery to roteleve data from one ar more tablas, objact 

CREATE SYNONYM tabioe, viown, 1 OF materialized viows, 

lo CREATE TRIGGER lE part oe all of the ramuli of à SELECT abt manent ia aniv alana to mrt existing 

Bp] 46 SQL Ststemente: materialized view, then Oracle Database may ite the mabe alized view In place of ane 

CREATE LIBRARY lo or more tabiot epocified in the SELECT statement. This subseitutian ia called query 

CREATE SPFILE rewelte, It rakes place only if cort optimization is ensbied and the QUEAY_AEWRITE_ 
ENABLED parecer is set to TRUE. To deterring whether query write hat occurred, 

mf] 17 SQL Stetementa: use the EXPLAIN DLAN statoment 

CREATE TYPE to 

DROP ROLLBACK Sor Alto: 

SEGMENT n Chapter 9, “SQL Quertes and Subquerias'' for genera? information 

BH] 19 SOL Statements: TA 

DROP SEQUENCE to a Onede Datctme Dott Aitrefrouning Guide for sore infarrmnaiion on 

ROLLBACK materialized views and query rowslte 


19 SQL Statemanta: a EXPLAIN PLAN on page 18-20 
SAVEPOINT to 
UPDATE 
fy SAVEPOINT For you to sulact date from s tabla or matocialleod view, the mblo or materiatized view 
must be in your own scheme or you crust have the SELECT privilege on the table or 
tha mristeed view. 
For you to slart rows froin the bare mblas of a view: 
e You must heve the SELECT privilege on thw view, and 
e Whoever owne the schort corrtaining the view raut Rave the BELBCT privilege 
an tho bape tables, 
The SELECT ASY TARLA system privilege alao allows you to select date from any table 
Ot any materialized view or the bana table of arty view. 
Liami: an Orada Flashback ret oun eir m aym S pE eeh you must 
SELECT privilege on the objects in the select Har. da addition, eather vou must 
4 TRUNCATE TABLE 3i have FLASHBACK object privilege on the objects in the select list, or you must have 
FLASHBACK ANY TABLE system privilege. 


Description 


e The Oracle Database SQL Reference manual is available as a searchable PDF file that 
contains detailed technical information about the SQL statements that work with the 
Oracle Database. 

e Ifyou have the PDF file for the SQL Reference manual installed on your computer, you 
can Start it by using the Windows Explorer to double-click on the file. 

e To scroll through the table of contents, click on the plus (+) and minus {-) signs to 
expand or collapse the topics. When you find the topic you want, click on it to display 
the topic in the right pane. Then, you can scroll up or down to view the entire topic. 

e To search for a particular word or phrase, type the word or phrase in the Find text box in 
the toolbar and press the Enter key. Then, you can click on the Find Next or Find Previ- 
ous button to find the next or previous occurrence of the word or phrase. 


Note 


e The name of the Oracle Database SQL Reference manual varies slightly from one 
release to another. 


Figure 2-13 How to use the Oracle Database SQL Reference manual 
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Perspective 


In this chapter, you learned how to use the tools that you need to work with an 
Oracle database and to enter and execute SQL statements. With that as back- 
ground, you’re ready to go on to the next chapter, where you'll start learning the 
details of coding your own SQL statements. To start, you’ll learn the details for 
coding a SELECT statement that retrieves data from a single table. 

Before you go on to the next chapter, though, we recommend that you install 
all of the software for this book on your computer as described in Appendix A. You 
should also do the exercises that follow. That will get you off to a good start with 
Oracle Database and the Oracle Database tools. 


Terms 


Oracle Database 
database service 
database server 
database engine 
database listener 
Database Home Page 
SQL*Plus 

database object 

Oracle SQL Developer 
snippet 

SQL script 

Oracle Database SQL Reference manual 


Before you start the exercises... 


Before you start the exercises for this chapter, you need to install Oracle 
Database Express Edition and Oracle SQL Developer. You also need to 
download and install the source files for this book, and you need to create the 
users and tables for this book. The procedures for doing all of these tasks are 
provided in appendix A. 


Exercises 


In these exercises, you’ll use SQL Developer to review the tables in the AP 
schema that’s used throughout this book. In addition, you’ll use SQL developer 
to enter SQL statements and run them against these tables. 


Make sure Oracle Database is running 

1. Use the procedure in figure 2-1 to start the database service for Oracle 
Database. If it is already running, you'll get a message that confirms that. 
Otherwise, the database service will be started. Either way, you can close the 
DOS window that’s opened. 
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Use SQL Developer to review the Accounts Payable database 


2. 


Start Oracle SQL Developer. If you created a menu or desktop shortcut when 
you installed SQL Developer, you can use that shortcut now. Otherwise, you can 
use the Windows Explorer to find and double-click on the sqldeveloper.exe file. 


Create the AP, EX, and OM connections as described in figure 2-4. When you’re 
done, the Connections window should display three connections: AP, EX, and 
OM. 


In the Connections window, click on the AP connection to expand it. When 
you're prompted for a password, enter “ap”. That will expand the connection so 
you can see all of the database objects in the AP schema. 


Use the techniques in figures 2-5 and 2-6 to navigate through the database 
objects and view the column definitions for at least the Vendors and Invoices 
tables. 


Use the technique in figure 2-6 to view the data for the Vendors and Invoices 
tables. 


Use SQL Developer to enter and run SQL statements 


7. 


10. 


11. 


12. 


Use the technique in figure 2-8 to open a SQL Worksheet window for the AP 
connection. Then, enter and run this SQL statement: 


SELECT vendor_name FROM vendors 


Use the code completion feature described in figure 2-8 to enhance this SQL 

statement so it includes an ORDER BY clause and some additional columns like 

this: 

SELECT vendor_name, vendor_addressl, vendor_city, vendor_state, 
vendor zip code 


FROM vendors 
ORDER BY vendor name 


Then, run the statement. 


Move the cursor into the ORDER BY clause and press Ctrl+/ to comment out 
the line. Then, press Ctrl+/ again to uncomment the line. 


Delete the e at the end of vendor_zip_code and run the statement again. Note 
that this syntax error is handled as in figure 2-10. 


Open another Worksheet window, and use the COUNT and SUM snippets as 
shown in figure 2-9 as you enter this statement: 
SELECT COUNT(*) AS number of invoices, 


SUM(invoice total) AS grand invoice total 
FROM invoices 


Then, run the statement. 


Use the Tools>Preferences command to set the default path for scripts as 
described in figure 2-11. Then, click on the tab for the Worksheet window of 
exercise 8, click the Save button to save this statement, and note the directory in 
the Save dialog box. Next, click the Cancel button in the Save dialog box to 
cancel the command. 
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Use SQL Developer to open and run scripts 


13. 


14. 


15. 


16. 


Use the technique in figure 2-11 to open the select_vendor_city_state script 
that’s in the c:\murach\oracle_sql\scripts\ch02 directory. Notice that this script 
contains just one SQL statement. Then, run the statement. Because you didn’t 
specify a connection for this statement, SQL Developer will ask you to select 
one before it runs the statement. 


Click on the Open button. Note that the recently used directories including the 
ch02 directory are shown on the left side of the Open dialog box. Then, click 
on the ch02 directory to display the files that are stored in this directory. Next, 
click the Cancel button to close this dialog box. 


Open the select_vendor_total_due script that’s in the ch02 directory. Note that 
this opens another tab. Next, select the AP connection from the connection list 
and run this script. 


Open the select_vendor_information script that’s in the ch02 directory. Notice 
that this script contains two SQL statements that end with semicolons (scroll 
down if you need to). Then, move the insertion point to the first statement and 
press F9 to run that statement. Next, move the insertion point to the second 
statement and press F9 to run that statement. Last, press F5 or click the Run 
Script button to run both of the statements that are stored in this script. If you 
scroll through the Script Output window, you will see the results of the two 
SELECT statements that were run. 


Close and restart SQL Developer 


17. 


18. 


19. 


20. 


Continue to experiment on your own. Make sure to leave at least one saved 
script open. When yov’re ready to end this session, use the File>Exit command 
or click on the Close button in the upper right corner of the SQL Developer 
window. 


Restart SQL Developer. When it starts, notice that all of the saved scripts that 
you left open are automatically opened. However, any unsaved scripts that you 
entered are lost. 


Run one of the open scripts. Note that you have to select a connection and 
provide a password for the connection before the script will run. 


Exit from SQL Developer. 
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The essential SQL skills 


This section teaches you the essential SQL coding skills for working with the 
data in an Oracle database. The first four chapters show you how to retrieve data 
from a database using the SELECT statement. In chapter 3, you’ll learn how to 
code the basic clauses of the SELECT statement to retrieve data from a single 
table. In chapter 4, you’ll learn how to retrieve data from two or more tables. In 
chapter 5, you’ll learn how to summarize the data that you retrieve. And in 
chapter 6, you'll learn how to code subqueries, which are SELECT statements 
that are coded within other statements. 

Next, chapter 7 shows you how to use the INSERT, UPDATE, and DELETE 
statements to add new rows to a table, to modify rows in a table, and to remove 
rows from a table. Then, chapter 8 shows you how to work with the various types 
of data that Oracle supports and how to use some of the built-in functions that 
Oracle provides for working with these data types. When you complete the six 
chapters in this section, you’ll have all the skills you need to code SELECT, 
INSERT, UPDATE, and DELETE statements. 
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How to retrieve data from a 
single table 


In this chapter, you’ll learn how to code SELECT statements that retrieve data 
from a single table. You should realize, though, that the skills covered here are 
the essential ones that apply to any SELECT statement, no matter how many 
tables it operates on and no matter how complex the retrieval. So you’ll want to 
have a good understanding of the material in this chapter before you go on to 
the chapters that follow. 
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An introduction to the SELECT 
statement 


To help you learn to code the SELECT statement, this chapter starts by 
presenting its basic syntax. Next, it presents several examples that will give you 
an idea of what you can do with this statement. Then, the rest of this chapter 
will teach you the details of coding this statement. 


The basic syntax of the SELECT statement 


Figure 3-1 presents the basic syntax of the SELECT statement. The syntax 
summary at the top of this figure uses some conventions that are used through- 
out this book. First, capitalized words are called keywords, and you must spell 
them exactly as shown, though you can use whatever capitalization you prefer. 
For example, the following are equivalent: “SELECT”, “Select”, “select” and 
“sELeCt”. Second, you must provide replacements for words in lowercase. For 
example, you must enter a list of columns in place of select_list, and you must 
enter a table name in place of table_source. 

Beyond that, you can choose between the items in a syntax summary that 
are Separated by pipes (|) and enclosed in braces ({}) or brackets ([]). And you 
can omit items enclosed in brackets. If you have a choice between two or more 
optional items, the default item is underlined. And if an element can be coded 
multiple times in a statement, it’s followed by an ellipsis (...). You’ll see ex- 
amples of pipes, braces, default values, and ellipses in syntax summaries later in 
this chapter. For now, compare the syntax in this figure with the coding ex- 
amples in figure 3-2 to see how the two are related. 

The syntax summary in this figure has been simplified so that you can focus 
on the four main clauses of the SELECT statement: the SELECT clause, the 
FROM clause, the WHERE clause, and the ORDER BY clause. Most of the 
SELECT statements you code will contain all four clauses. However, only the 
SELECT and FROM clauses are required. 

The SELECT clause is always the first clause in a SELECT statement. It 
identifies the columns you want to include in the result set. These columns are 
retrieved from the base tables named in the FROM clause. Since this chapter 
focuses on retrieving data from a single table, the FROM clauses in all of the 
statements in this chapter name a single base table. In the next chapter, though, 
you'll learn how to retrieve data from two or more tables. And as you progress 
through this book, you’ll learn how to select data from other sources such as 
views and expressions. 

The WHERE and ORDER BY clauses are optional. The ORDER BY clause 
determines how the rows in the result set are to be sorted, and the WHERE 
clause determines which rows in the base table are to be included in the result 
set. The WHERE clause specifies a search condition that’s used to filter the 
rows in the base table. This search condition can consist of one or more Boolean 
expressions, or predicates. A Boolean expression is an expression that evaluates 
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The simplified syntax of the SELECT statement 


SELECT select list 
FROM table source 
[WHERE search condition] 
[ORDER BY order_by list] 


The four clauses of the SELECT statement 
Clause Description 


Describes the columns that will be included in the result set. 
Names the table from which the query will retrieve the data. 


WHERE Specifies the conditions that must be met for a row to be included in the result set. This 
clause is optional. 


ORDER BY Specifies how the rows in the result set will be sorted. This clause is optional. 


Description 
e You use the basic SELECT statement shown above to retrieve the columns specified in 


the SELECT clause from the base table specified in the FROM clause and store them in 
a result set. 


e The WHERE clause is used to filter the rows in the base table so that only those rows 
that match the search condition are included in the result set. If you omit the WHERE 
clause, all of the rows in the base table are included. 


e The search condition of a WHERE clause consists of one or more Boolean expressions, 
or predicates, that result in a value of True, False, or Unknown. If the combination of all 
the expressions is True, the row being tested is included in the result set. Otherwise, it’s 
not. 


e Ifyou include the ORDER BY clause, the rows in the result set are sorted in the speci- 
fied sequence. Otherwise, the sequence of the rows is not guaranteed by Oracle. 


Note 


e The syntax shown above does not include all of the clauses of the SELECT statement. 
You'll learn about the other clauses later in this book. 


Figure 3-1 The basic syntax of the SELECT statement 
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to True or False. When the search condition evaluates to True, the row is in- 
cluded in the result set. 

In this book, we won’t use the terms “Boolean expression” or “predicate” 
because they don’t clearly describe the content of the WHERE clause. Instead, 
we'll just use the term “search condition” to refer to an expression that evaluates 
to True or False. 


SELECT statement examples 


Figure 3-2 presents five SELECT statement examples. All of these state- 
ments retrieve data from the Invoices table. 

The first statement in this figure retrieves all of the rows and columns from 
the Invoices table. This statement uses an asterisk (*) as a shorthand to indicate 
that all of the columns should be retrieved, and the WHERE clause is omitted so 
there are no conditions on the rows that are retrieved, You can see the results 
after this statement as they’re displayed by SQL Developer. Here, both horizon- 
tal and vertical scroll bars are displayed, indicating that the result set contains 
more rows and columns than can be displayed on the screen at one time. 

Notice that this statement doesn’t include an ORDER BY clause. Without 
an ORDER BY clause, Oracle doesn’t guarantee the sequence in which the rows 
are presented. They might be in the sequence you expect, or they might not. As 
a result, if the sequence matters to you, you should include an ORDER BY 
clause. 

The second statement retrieves selected columns from the Invoices table. As 
you can see, the columns to be retrieved are listed in the SELECT clause. Like 
the first statement, this statement doesn’t include a WHERE clause, so all the 
rows are retrieved. Then, the ORDER BY clause causes the rows to be sorted by 
the invoice_total column in ascending sequence. Later in this chapter, you’ll 
learn how to sort rows in descending sequence. 

The third statement also lists the columns to be retrieved. In this case, 
though, the last column is calculated from two columns in the base table 
(credit_total and payment_total), and the resulting column is given the name 
total_credits. In addition, the WHERE clause specifies that only the invoice with 
an invoice_id of 17 should be retrieved. 

The fourth SELECT statement includes a WHERE clause whose condition 
specifies a range of values. In this case, only invoices with invoice dates be- 
tween May 1, 2008 and May 31, 2008 are retrieved. In addition, the rows in the 
result set are sorted by invoice date. 

The last statement in this figure shows another variation of the WHERE 
clause. In this case, only those rows with an invoice_total greater than 50,000 
are retrieved. Since none of the rows in the Invoices table satisfies this condi- 
tion, the result set is empty. 
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A SELECT statement that retrieves all the data from the Invoices table 


SELECT * 
FROM invoices 


® invoice jA venbor_p lA invoice_numBeR | invoice_pate |) invoice_tota |A payment_totat | cREDIT_TOTAL 
34 QP58872 25-FEB-08 116.54 116.54 
34 Q545443 14-MAR-08 1083.58 1083.58 


110 PF-0608 11-4PR-08 20551.18 Qo 
110 F-02598 16-4PR-08 268814 268814 


(114 rows selected) 


A SELECT statement that retrieves three columns from each row, sorted 
in ascending sequence by invoice_total 


SELECT invoice number, invoice date, invoice total 
FROM invoices 
ORDER BY invoice total 

 INVOICE_NUMBER Y INVOICE_D4TE |A INVOICE_TOT&L 


1 25022117 24-MAY-08 
2 24863706 27-May-08 


3 24780512 29-MAY-08 
4 21-4748363 09-May-08 


(114 rows selected) 


A SELECT statement that retrieves two columns and a calculated value 
for a specific invoice 


SELECT invoice id, invoice total, 

(credit total + payment_total) AS total_credits 
FROM invoices 
WHERE invoice id = 17 


W INVOICE JO | INVOICE_ToTaL |$ TOTAL_CREDITS 


1 17 356.46 356.46 


A SELECT statement that retrieves all invoices between given dates 


SELECT invoice_number, invoice_date, invoice_total 

FROM invoices 

WHERE invoice_date BETWEEN '01-MAY-2008' AND '31-MAY-2008' 
ORDER BY invoice date 


INVOICE_NUMBER | INVOICE_DATE |] INVOICE_TOTAL 
1 7548908-20 O1-MaY-08 


2 4-321-2596 01-May-08 
3 4-327-7357 01-May-08 
4 4-342-6069 01-May-08 


(70 rows selected) 


A SELECT statement that returns an empty result set 


SELECT invoice number, invoice date, invoice total 
FROM invoices 
WHERE invoice total > 50000 


INVOICE_NUMBER |È INVOICE_DaTE |] INvOICE_TOTAL 


Figure 3-2 SELECT statement examples 
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How to code the SELECT clause 


Figure 3-3 presents an expanded syntax for the SELECT clause. The 
keywords shown in the first line allow you to restrict the rows that are returned 
by a query. You’ll learn how to code them in a moment. But first, you’ll learn 
various techniques for identifying which columns are to be included in a result 
set. 


How to code column specifications 


Figure 3-3 summarizes the techniques you can use to code column specifi- 
cations. You saw how to use some of these techniques in the previous figure. For 
example, you can code an asterisk in the SELECT clause to retrieve all of the 
columns in the base table, and you can code a list of column names separated by 
commas. Note that when you code an asterisk, the columns are returned in the 
order that they occur in the base table. 

You can also code a column specification as an expression. For example, 
you can use an arithmetic expression to perform a calculation on two or more 
columns in the base table, and you can use a string expression to combine two 
or more string values. An expression can also include one or more functions. 
You’ll learn more about each of these techniques in the topics that follow. 


Chapter3 How to retrieve data from a single table 


The expanded syntax of the SELECT clause 


SELECT [ALL|DISTINCT] 
column_specification [[AS] result_column] 
[, column specification [[AS] result_column] ] 


Five ways to code column specifications 
Source 


Base table value All columns * 


Column name column_name 


Calculated value Result of a concatenation String expression (see figure 3-5) 
Result of a calculation Arithmetic expression (see figure 3-6) 
Result of a scalar function Scalar function (see figure 3-7) 


Column specifications that use base table values 
The * is used to retrieve all columns 
SELECT * 


Column names are used to retrieve specific columns 
SELECT vendor name, vendor city, vendor state 


Column specifications that use calculated values 


An arithmetic expression is used to calculate balance_due 


SELECT invoice number, 
invoice_total - payment _total - credit_total AS balance due 


A string expression is used to derive full_name 
SELECT first_name || ' ' || last_name AS full_name 


Description 


e Use SELECT * only when you need to retrieve all columns from a table. Otherwise, list 
the names of the columns you need. 


e An expression is a combination of column names and operators that evaluate to a single 
value. In the SELECT clause, you can code arithmetic expressions, string expressions, 
and expressions that include one or more functions. 


e After each column specification, you can code an AS clause to specify the name for the 
column in the result set. See figure 3-4 for details. 


Note 


e The other elements shown in the syntax summary above let you control the number of 
rows that are returned by a query. You can use the DISTINCT keyword to eliminate 
duplicate rows. See figure 3-9 for details. 


Figure 3-3 How to code column specifications 
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How to name the columns in a result set 


By default, a column in a result set is given the same name as the column in 
the base table. You can specify a different name, however, if you need to. You 
can also name a column that contains a calculated value. When you do that, the 
new column name is called a column alias. Figure 3-4 presents two techniques 
for creating column aliases. 

The first technique is to code the column specification followed by the AS 
keyword and the column alias. This is the coding technique specified by the 
American National Standards Institute (ANSI, pronounced ‘ann-see’), and it’s 
illustrated by the first example in this figure. 

The second technique is to code the column specification followed by a 
space and the column alias. This coding technique is illustrated by the second 
example. Whenever possible, though, you should use the first technique since 
the AS keyword makes it easier to identify the alias for the column, which 
makes your SQL statement easier to read and maintain. 

When you code an alias, you must enclose the alias in double quotes if the 
alias contains a space or is a keyword that’s reserved by Oracle. In this figure, 
the first two examples specify an alias for the invoice_number column that uses 
two words with a space between them. 

In addition, these two examples specify an alias for the invoice_date column 
that uses a keyword that’s reserved by Oracle: the DATE keyword. If you don’t 
enclose this keyword in double quotes, you will get an error when you attempt 
to execute either of these SQL statements. When you enter a statement into SQL 
Developer, it boldfaces keywords that are reserved by Oracle. This makes it 
easy to identify Oracle keywords when you’re writing SQL statements. 

When you enclose an alias in double quotes, the result set uses the capitali- 
zation specified by the alias. Otherwise, the result set capitalizes all letters in the 
column name. In this figure, for instance, the first two columns in the first result 
set use the capitalization specified by the aliases. However, since no alias is 
specified for the third column, all letters in the name of this column are capitalized. 

When you code a column that contains a calculated value, it’s a good 
practice to specify an alias for the calculated column. If you don’t, Oracle will 
assign the entire calculation as the name, which can be unwieldy, as shown in 
the third example. As a result, you usually assign a name to any column that’s 
calculated from other columns in the base table. 
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Two SELECT statements that name the columns in the result set 


A SELECT statement that uses the AS keyword 


-- DATE is a reserved keyword. 

-- As a result, it must be enclosed in quotations. 

SELECT invoice_number AS "Invoice Number", invoice date AS "Date", 
invoice _total AS total 

FROM invoices 


A SELECT statement that omits the AS keyword 


SELECT invoice number "Invoice Number", invoice date "Date", 
invoice total total 
FROM invoices 


The result set for both SELECT statements 


Í) invoce Number || 
1 QP50872 25-FEB-08 116.54 


2 9545443 14-MAR-08 1083.58 


3 P-O608 11-APR-O8 20551.18 
4 P-0259 16-APR-08 26881 4 
5 MABO1489 16-APR-08 936.93 


A SELECT statement that doesn’t provide a name for a calculated column 


SELECT invoice number, invoice date, invoice total, 
invoice total - payment total - credit total 

FROM invoices 
J invoice NUMBER | Invoice_paTe | Invoice_ToTaL | INVOICE_TOTAL-PAYMENT_TOTAL-CREDIT_TOTAL 


1 QPSBB72 25-FEB-08 116.54 0 
2 0545443 14-MAR-08 1083.58 Qo 


3 P-0608 11-A4PR-08 20551.18 18351.18 
4 PF-0259 16-APR-08 26881 .4 D 
5 MABO1489 16-APR-08 936.93 D 


Description 


By default, a column in the result set is given the same name as the column in the base 
table. If that’s not what you want, you can specify a column alias for the column. 

One way to name a column is to use the AS keyword as shown in the first example 
above. Although the AS keyword is optional, it enhances readability. 

If an alias includes spaces or an Oracle reserved keyword, you must enclose it in double 
quotes. 

When you enclose an alias in quotes, the result set uses the capitalization specified by 
the alias. Otherwise, the result set capitalizes all letters in the column name. 


Figure 3-4 How to name the columns in a result set 
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How to code string expressions 


A string expression consists of a combination of one or more character 
columns and literal values. To combine, or concatenate, the columns and values, 
you use the concatenation operator (ll). This is illustrated by the examples in 
figure 3-5. 

The first example shows how to concatenate the vendor_city and 
vendor_state columns in the Vendors table. Notice that because no alias is 
assigned to this column, Oracle assigns a name, which is the entire expression. 
Also notice that the data in the vendor_state column appears immediately after 
the data in the vendor_city column in the results. That’s because of the way 
vendor_city is defined in the database. Because it’s defined as a variable-length 
column (the VARCHAR? data type), only the actual data in the column is in- 
cluded in the result. In contrast, if the column had been defined with a fixed 
length, any spaces after the name would have been included in the result. You’ ll 
learn about data types and how they affect the data in your result set in chapter 8. 

The second example shows how to format a string expression by adding 
spaces and punctuation. Here, the vendor_city column is concatenated with a 
string literal, or string constant, that contains a comma and a space. Then, the 
vendor_state column is concatenated with that result, followed by a string literal 
that contains a single space and the vendor_zip_code column. 

Occasionally, you may need to include a single quotation mark or an apostro- 
phe within a literal string. If you simply type a single quote, however, the system 
will misinterpret it as the end of the literal string. As a result, you must code two 
single quotation marks in a row. This is illustrated by the third example in this 
figure. 
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How to concatenate string data 


SELECT vendor city, vendor_state, vendor city || vendor state 
FROM vendors 


A VENDOR CITY |VENDOR_STATE | YENDOR_CITY|VENDOR_STATE 
1 Auburn Hills Auburn HllisMl 
2 Fresno Fresnoca, 


3 Olathe OlatheKS 
4 Fresno FresnocA 
5 East Brunswick East BrunsyickNd 


How to format string data using literal values 


SELECT vendor_name, 
vendor city || ', ' || vendor state || ' ' || 
vendor _zip code AS address 

FROM vendors 


A YENDOR NAME 4) ADDRESS 
1 Data Reproductlons Corp Auburn Hills, Ml 48326 
2 Executlye Offlos Products Fresno, C4, 93710 
3 Leslie Company Olathe, KS 66061 
4 Retirement Plan Consultants Fresno, CA 93704 
5 Simon Direct Inc East Brunswick, NJ 08816 


How to include apostrophes in literal values 


SELECT vendor_name || '''s address: ', 
vendor city || ', ' || vendor_state || ' ' || vendor_zip code 
FROM vendors 


A VENDOR_NAME||"S40DRESS:’ 4» VENDOR_CITY'| IVENDOR_STATE|"|¥ENDOR_ZIP_CODE 
1 Data Reproductlons Corp's address: Auburn Hills, Ml 48328 
2 Executlya Offices Products's address: Fresno, C4, 93710 


3 Leslie Company's address: Olathe, KS 66064 
4 Retirement Plan Consuttants's address: Fresno, CA 93704 
5 Simon Direct Inc's address: East Brunswick, NJ 08816 


Description 


A string expression can consist of one or more character columns, one or more literal 
values, or a combination of character columns and literal values. 


The columns specified in a string expression must contain string data (that means they’re 
defined with the CHAR or VARCHAR? data type). 

The literal values in a string expression also contain string data, so they can be called 
string literals or string constants. To create a literal value, enclose one or more charac- 
ters within single quotation marks (' ). 

You can use the concatenation operator (Il) to combine columns and literals in a string 
expression. 

You can include a single quote within a literal value by coding two single quotation 
marks, as shown in the third example above. 


Figure 3-5 How to code string expressions 
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How to code arithmetic expressions 


Figure 3-6 shows how to code arithmetic expressions. To start, it summa- 
rizes the four arithmetic operators you can use in this type of expression. Then, 
it presents two examples that show how to use these operators. 

The SELECT statement in the first example includes an arithmetic expres- 
sion that calculates the balance due for an invoice. This expression subtracts the 
payment_total and credit_total columns from the invoice_total column. The 
resulting column is given the name balance_due. 

When Oracle evaluates an arithmetic expression, it performs the operations 
from left to right based on the order of precedence. This order says that multi- 
plication and division are done first, followed by addition and subtraction. If 
that’s not what you want, you can use parentheses to specify how you want an 
expression evaluated. Then, the expressions in the innermost sets of parentheses 
are evaluated first, followed by the expressions in outer sets of parentheses. 
Within each set of parentheses, the expression is evaluated from left to right in 
the order of precedence. 

To illustrate how parentheses and the order of precedence affect the evalua- 
tion of an expression, consider the second example in this figure. Here, the 
expressions in the second and third columns both use the same operators. 
However, when Oracle evaluates the expression in the second column, it per- 
forms the multiplication operation before the addition operation because multi- 
plication comes before addition in the order of precedence. In contrast, when 
Oracle evaluates the expression in the third column, it performs the addition 
operation first because it’s enclosed in parentheses. As you can see in the result 
set, these two expressions result in different values. 

Unlike some other databases, Oracle doesn’t provide a modulo operator that 
can be used to return the remainder of a division operation. Instead, you must 
use the MOD function as described in the next figure. 
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The arithmetic operators in order of precedence 
Multiplication 


Division 
Addition 
Subtraction 


A SELECT statement that calculates the balance due 


SELECT invoice total, payment total, credit total, 
invoice_total - payment_total - credit_total AS balance due 
FROM invoices 


a INYOICE_TOTAL i N 
116.54 116.54 
1083.58 1083.58 
20551.18 D 
26881 .4 26881 4 
936.93 936.93 


A SELECT statement that uses parentheses to control the sequence of 
operations 


SELECT invoice id, 
invoice id + 7 * 3 AS order _of precedence, 
(invoice id + 7) * 3 AS add first 

FROM invoices 

ORDER BY invoice id 


a INVOICE_ID hi ORDER_OF_PRECEDENCE |H ADD_FIRST 


Description 


e Unless parentheses are used, the operations in an expression take place from left to right 
in the order of precedence. For arithmetic expressions, multiplication and division are 
done first, followed by addition and subtraction. 


e Whenever necessary, you can use parentheses to clarify or override the sequence of 
operations. Then, the operations in the innermost sets of parentheses are done first, 
followed by the operations in the next sets, and so on. 


Figure 3-6 How to code arithmetic expressions 
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How to use scalar functions 


Figure 3-7 introduces you to scalar functions, which operate on a single 
value and return a single value. These functions work differently than the 
aggregate functions described in chapter 5 that are used to summarize data. For 
now, don’t worry about the details of how these functions work, because you'll 
learn more about them in chapter 8. Instead, just focus on how they’re used in 
column specifications. 

To code a function, you begin by entering its name followed by a set of 
parentheses. If the function requires one or more parameters, you enter them 
within the parentheses and separate them with commas. When you enter a 
parameter, you need to be sure it has the correct data type. 

The first example in this figure shows how to use the SUBSTR function to 
extract the first character of the vendor_contact_first_name and 
vendor_contact_last_name columns. The first parameter of this function speci- 
fies the column name; the second parameter specifies the starting position; and 
the third parameter specifies the number of characters to return. The results of 
the two functions are then concatenated to form initials, as shown in the result 
set for this statement. 

The second example shows how to use the TO_CHAR function. This 
function converts a column with a DATE or NUMBER data type to a character 
string. A common use for it is in concatenation operations, where all the data 
being concatenated must be string data. This function has two parameters. The 
first parameter, which specifies the column name, is required. The second 
parameter, which specifies a format mask for the column, is optional. In this 
example, a format mask of ‘MM/DD/YYYY’ is used to convert the 
payment_date column from a DATE type to a CHAR type. This format mask 
specifies that the date should be displayed with a two-digit month, followed by 
a forward slash, followed by a two-digit day, followed by another forward slash, 
followed by a four-digit year. 

In the second example, the payment_date column for Invoice # P-0608 is 
NULL. Note that this causes the TO_CHAR function to return an empty string 
for the payment date. 

The third example uses the SYSDATE function to return the current date. 
Since this function doesn’t accept any parameters, you don’t need to code any 
parentheses after the name of the function. In fact, if you do code parentheses 
after the name of the function, you will get an error when you execute the 
statement. In this example, the second column uses the SYSDATE function to 
return the current date, and the third column uses the SYSDATE function to 
calculate the number of days between the two dates. Here, the third column also 
uses the ROUND function to round the decimal value that’s returned by the 
calculation to an integer. 

The fourth example shows how to use the MOD function to return the 
remainder of a division of two integers, Here, the second column contains an 
expression that returns the remainder of the division operation when the in- 
voice-id column is divided by 10. In the result set, you can see the results for 
IDs 9 through 11 (the remainders are 9, 0, and 1). 


Chapter3 How to retrieve data from a single table 93 


A SELECT statement that uses the SUBSTR function 


SELECT vendor contact first name, vendor contact last name, 
SUBSTR(vendor_contact_first_name, 1, 1) 
SUBSTR (vendor contact last name, 1, 1) AS initials 

FROM vendors 


p YENDOR_CONTACT_FIRST_NAME p YENDOR_CONTACT_LAST_NAME p INITIALS 
1 Cesar Arodondo 


2 Rachael Danielson 
3 Zey Alondra 
4 Salna Edgardo 
5 Daniel Bradisa 


A SELECT statement that uses the TO_CHAR function 


SELECT 'Invoice: # ' || invoice_number || ', dated ' || 
TO CHAR(payment date, 'MM/DD/YYYY') | | 
' for $' || TO CHAR(payment total) AS "Invoice Text" 


FROM invoices 


Invoice Text 
1 Involes: # GPS8872, dated 041 1/2006 for $116.54 
2 Involos: # 545443, dated 05 4/2006 tor $1083.58 
3 Involce: #P-0606, dated for $0 
4 Invoice: # P-0259, dated 054 2/2006 for $26881 4 
5 Invoice: # MA8O1489, dated 05M 3/2006 for $936.93 


A SELECT statement that uses the SYSDATE and ROUND functions 


SELECT invoice_date, 

SYSDATE AS today, 

ROUND (SYSDATE - invoice_date) AS invoice_age_in days 
FROM invoices 


D INVOICE_DATE J TODAY |i INVOICE_AGE_IN_ DAYS 
1 18-JUL-08 01-AUG-08 
2 20-JUN-08 01-AUG-08 
3 14-JUN-08 01-ALIG-08 


A SELECT statement that uses the MOD function 


SELECT invoice id, 

MOD (invoice id, 10) AS Remainder 
FROM invoices 
ORDER BY invoice _id 


Description 
e A SQL statement can include a function. A function performs an operation and returns a 
value. 


e For more information on using functions, see chapter 8. 


Figure 3-7 How to use scalar functions 
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How to use the Dual table 


The Dual table is automatically available to all users. This table is useful for 
testing expressions that use literal values, arithmetic calculations, and functions 
as shown in figure 3-8. In particular, the Dual table is often used in the docu- 
mentation that shows how Oracle’s built-in scalar functions work. 

In the example in this figure, the second column in the result set shows the 
value of the calculation 10 minus 7, and the third column shows the date that’s 
returned by the SYSDATE function. This shows that you can perform test 
calculations in more than one column of the Dual table. 
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A SELECT statement that uses the Dual table 


SELECT 'test' AS test_string, 
10-7 AS test_calculation, 
SYSDATE AS test date 

FROM Dual 


TEST_STRING | TEST_CALCULATION | TEST_DATE 
1 test 3 01-AUG-08 


Description 


e The Dual table is automatically created and made available to users. 


e The Dual table is useful for testing expressions that use literal values, arithmetic, opera- 
tors, and functions. 


Figure 3-8 How to use the Dual table 
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How to use the DISTINCT keyword to eliminate 
duplicate rows 


By default, all of the rows in the base table that satisfy the search condition 
in the WHERE clause are included in the result set. In some cases, though, that 
means that the result set will contain duplicate rows, or rows whose column 
values are identical. If that’s not what you want, you can include the DISTINCT 
keyword in the SELECT clause to eliminate the duplicate rows. 

Figure 3-9 illustrates how this works. Here, both SELECT statements 
retrieve the vendor_city and vendor_state columns from the Vendors table, The 
first statement, however, doesn’t include the DISTINCT keyword. Because of 
that, the same city and state can appear in the result set multiple times. In the 
results shown in this figure, for example, you can see that “Anaheim CA” 
occurs twice. In contrast, the second statement includes the DISTINCT key- 
word, so each city/state combination is included only once. 
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A SELECT statement that returns all rows 


SELECT vendor _ city, vendor state 
FROM vendors 
ORDER BY vendor city 


j VENDOR_cITY | VENDOR_STATE 
1 Anaheim 
2 Anahelm 


3 Ann Arbor 
4 Auburn Hills 
5 Boston 


(122 rows selected) 


A SELECT statement that eliminates duplicate rows 


SELECT DISTINCT vendor_city, vendor_state 
FROM vendors 
ORDER BY vendor city 


J VENDOR CITY |$ VENDOR_STATE 
1 ånahelm 
2 Ann Arbor 


3 Auburn Hills 
4 Boston 
5 Brea 


(53 rows selected) 


Description 


e The DISTINCT keyword prevents duplicate (identical) rows from being included in the 
result set. 


e The ALL keyword causes all rows matching the search condition to be included in the 
result set, regardless of whether rows are duplicated. Since this is the default, it’s a 
common practice to omit the ALL keyword. 


e To use the DISTINCT or ALL keyword, code it immediately after the SELECT keyword. 


Figure 3-9 How to use the DISTINCT keyword to eliminate duplicate rows 
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How to use the ROWNUM pseudo column to limit 
the number of rows 


In addition to eliminating duplicate rows, you can limit the number of rows 
that are retrieved by a SELECT statement. To do that, you can use the 
ROWNUM pseudo column as shown in figure 3-10. A pseudo column works 
similarly to a column in a table. However, you can only use a pseudo column to 
select data. In other words, you can’t insert, update, or delete the values stored 
in a pseudo column. 

If you want to learn more about how pseudo columns work, you can search 
the Oracle Database SQL Reference manual for pseudocolumn. Note that the 
Oracle documentation doesn’t include a space between the words pseudo and 
column. 

The first example shows how to limit the number of rows in the result set to 
the first five rows. Here, the ROWNUM pseudo column is used in the WHERE 
clause to return the first five rows in the Invoices table. 

The second example shows how to add an ORDER BY clause to sort the 
first five rows of the table so the largest invoice total is displayed first. Since the 
sort operation is applied after the first five rows are retrieved, this doesn’t 
retrieve the five largest invoice totals in the Invoices table. Instead, it returns the 
first five rows of the table and then sorts them. 

If you want to return the five largest invoice totals for the entire Invoices 
table, you need to sort the result set before you use the ROWNUM pseudo 
column to limit the number of rows included in the result set. To do that, you 
can use a subquery as shown in the third example. In the FROM clause, this 
example supplies a SELECT statement that sorts the Invoices table instead of 
supplying the name of the Invoices table. As a result, the table is sorted before 
the WHERE clause is applied. 

For more information about working with subqueries, see chapter 6. In 
addition, if the ROWNUM pseudo column isn’t adequate for your needs, you 
might want to use the ROW_NUMBER function described in chapter 8. 
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A SELECT statement that uses the ROWNUM pseudo column to limit the 
number of rows in the result set 


SELECT vendor id, invoice total 
FROM invoices 
WHERE ROWNUM <= 5 


j VENDOR JD |É INVOICE_TOTAL 
116.54 
1083.58 


20551.18 
265881.4 
936.93 


A SELECT statement that sorts the result set after the WHERE clause 


SELECT vendor_id, invoice_total 
FROM invoices 

WHERE ROWNUM <= 5 

ORDER BY invoice_total DESC 


VENDOR JD |È INVOICE_TOTAL 
268814 
20551.18 
1083.58 
938.93 
116.54 


A SELECT statement that sorts the result set before the WHERE clause 


SELECT vendor id, invoice total 
FROM (SELECT * FROM invoices 
ORDER BY invoice total DESC) 
WHERE ROWNUM <= 5 
J YENDOR_ID | INVOICE_TOTAL 
37988.19 
288814 


23517.58 
21842 
20551.18 


Description 


e ‘You can use the ROWNUM pseudo column to limit the number of rows included in the 
result set. 


e Ifyou want to sort the result set before you use the ROWNUM pseudo column to limit 
the number of rows included in the result set, you can use a subquery as shown in the 
third example. For more information about working with subqueries, see chapter 6. 


Figure 3-10 | How to use the ROWNUM pseudo column 
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How to code the WHERE clause 


The WHERE clause in a SELECT statement filters the rows in the base 
table so that only the rows you need are retrieved. In the topics that follow, 
you'll learn a variety of ways to code this clause. 


How to use the comparison operators 


Figure 3-11 shows you how to use the comparison operators in the search 
condition of a WHERE clause. As you can see in the syntax summary at the top 
of this figure, you use a comparison operator to compare two expressions. If the 
result of the comparison is True, the row being tested is included in the query 
results. 

The examples in this figure show how to use some of the comparison 
operators. The first WHERE clause, for example, uses the equal operator (=) to 
retrieve only those rows whose vendor_state column have a value of IA. Since 
the state code is a string literal, it must be enclosed in single quotes. 

In contrast, a numeric literal like the one in the second WHERE clause isn’t 
enclosed in quotes. This clause uses the greater than (>) operator to retrieve 
only those rows that have a balance due greater than zero. 

The third WHERE clause illustrates another way to retrieve all the invoices 
with a balance due. Like the second clause, it uses the greater than operator. 
Instead of comparing the balance due to a value of zero, however, it compares 
the invoice total to the total of the payments and credits that have been applied 
to the invoice. 

The fourth WHERE clause illustrates how you can use comparison opera- 
tors other than the equal operator with string data. In this example, the less than 
operator (<) is used to compare the value of the vendor_name column to a literal 
string that has the letter M in the first position. That will cause the query to 
return all vendors with names that begin with the letters A through L. 

You can also use the comparison operators with date literals, as illustrated 
by the fifth and sixth WHERE clauses. The fifth clause will retrieve rows with 
invoice dates on or before May 31, 2008, and the sixth clause will retrieve rows 
with invoice dates on or after May 1, 2008. Like string literals, date literals must 
be enclosed in single quotes. In addition, you can use different formats to 
specify dates, as shown by the two date literals shown in this figure. You’ ll learn 
more about the acceptable date formats and date comparisons in chapter 8. 

The last WHERE clause shows how you can test for a not equal condition. 
To do that, you code a less than sign followed by a greater than sign. In this 
case, only rows with a credit total that’s not equal to zero will be retrieved. 

Whenever possible, you should compare expressions that have similar data 
types. If you attempt to compare expressions that have different data types, 
Oracle may implicitly convert the data types for you. Although implicit conver- 
sions are often acceptable, they will occasionally yield unexpected results. In 
chapter 8, you’ll learn how to explicitly convert data types so your comparisons 
will always yield the results that you want. 
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The syntax of the WHERE clause with comparison operators 


WHERE expression 1 operator expression 2 


The comparison operators 


Equal 
Greater than 
Less than 


Less than or equal to 
Greater than or equal to 
Not equal 


Examples of WHERE clauses that retrieve... 
Vendors located in lowa 
WHERE vendor state = 'IA' 


Invoices with a balance due (two variations) 
WHERE invoice total - payment _total - credit _total > 0 
WHERE invoice total > payment_total + credit total 


Vendors with names from A to L 
WHERE vendor_name < 'M' 


Invoices on or before a specified date 
WHERE invoice date <= '31-MAY-08' 


Invoices on or after a specified date 
WHERE invoice date >= '01-MAY-08' 


Invoices with credits that don’t equal zero 
WHERE credit_total <> 0 


Description 


e ‘You can use a comparison operator to compare any two expressions that result in like 
data types. Although unlike data types may be converted to data types that can be 
compared, the comparison may produce unexpected results. 


e Ifthe result of a comparison results in a True value, the row being tested is included in 
the result set. If it’s False or Unknown, the row isn’t included. 


e To use a string literal or a date literal in a comparison, enclose it in quotes. To use a 
numeric literal, enter the number without quotes. 


e Character comparisons are case-sensitive. ‘CA’ and ‘Ca’, for example, are not equivalent. 


Figure 3-11 How to use the comparison operators 
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How to use the AND, OR, and NOT logical 
operators 


Figure 3-12 shows how to use logical operators in a WHERE clause. You 
can use the AND and OR operators to combine two or more search conditions 
into a compound condition. And you can use the NOT operator to negate a 
search condition. The examples in this figure illustrate how these operators 
work. 

The first two examples illustrate the difference between the AND and OR 
operators, When you use the AND operator, both conditions must be true, So in 
the first example, only those vendors located in Springfield, New Jersey, are 
retrieved from the Vendors table. When you use the OR operator, though, only 
one of the conditions must be true. So in the second example, all vendors 
located in New Jersey and all the vendors located in Pittsburgh are retrieved. 

The third example shows a compound condition that uses two NOT opera- 
tors. As you can see, this expression is difficult to understand. Because of that, 
you should avoid using this operator. The fourth example in this figure, for 
instance, shows how the search condition in the third example can be rephrased 
to eliminate the NOT operator. As a result, the condition in the fourth example 
is much easier to understand. 

The last two examples in this figure show how the order of precedence for 
the logical operators and the use of parentheses affect the result of a search 
condition. By default, the NOT operator is evaluated first, followed by AND and 
then OR. However, you can use parentheses to override the order of precedence 
or to clarify a logical expression, just as you can with arithmetic expressions. In 
the next to last example, for instance, no parentheses are used, so the two 
conditions connected by the AND operator are evaluated first. In the last ex- 
ample, though, parentheses are used so the two conditions connected by the OR 
operator are evaluated first. 
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The syntax of the WHERE clause with logical operators 


WHERE [NOT] search condition 1 {AND|OR} [NOT] search condition 2 ... 


Examples of queries using logical operators 


A search condition that uses the AND operator 
WHERE vendor state = 'NJ' AND vendor city = 'Springfield' 


A search condition that uses the OR operator 
WHERE vender _state = 'NJ' OR vendor city = 'Pittsburgh' 


A search condition that uses the NOT operator 
WHERE NOT (invoice _total >= 5000 OR NOT invoice date <= '01-JUL-2008') 


The same condition rephrased to eliminate the NOT operator 
WHERE invoice total < 5000 AND invoice date <= '01-JUL-2008' 


A compound condition without parentheses 


SELECT invoice number, invoice date, invoice total 

FROM invoices 

WHERE invoice date > '01-MAY-2008' OR invoice total > 500 
AND invoice total - payment total - credit _ total > 0 
ORDER BY invoice_number 


@ invoice_NUMBER |) invoice_pate | invoice_toTaL 
1 0-2058 O8-MAY-O8 37966.19 
2 0-2060 O8-MAY-08 23517.58 
3 0-2438 OF-MAY-08 10376.08 


(91 rows selected) 


The same compound condition with parentheses 


WHERE (invoice date > '01-MAY-2008' OR invoice _total > 500) 
AND invoice total - payment total - credit total > 0 
ORDER BY invoice_number 


INVOICE_NUMBER |È INvoIcE_DATE | INVoICE_ToTAL 
1 0-2438 OF-MaYy-08 10976.06 
2 109596 4 d-JLIN-O8 418 


3 111-92R-10092 O4-JUN-08 46.21 


(39 rows selected) 


Description 


You can use the AND and OR logical operators to create compound conditions that 
consist of two or more conditions. You use the AND operator to specify that the search 
must satisfy both of the conditions, and you use the OR operator to specify that the 
search must satisfy at least one of the conditions. 


You can use the NOT operator to negate a condition, but that can make the search condi- 
tion difficult to understand. If it does, you should rephrase the condition to eliminate NOT. 
When Oracle evaluates a compound condition, it evaluates the operators in this se- 
quence: (1) NOT, (2) AND, and (3) OR. You can use parentheses to override this order of 
precedence or to clarify the sequence in which the operations will be evaluated. 


Figure 3-12 How to use the AND, OR, and NOT logical operators 
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How to use the IN operator 


Figure 3-13 shows how to code a WHERE clause that uses the IN operator. 
When you use this operator, the value of the test expression is compared with 
the list of expressions in the IN phrase. If the test expression is equal to one of 
the expressions in the list, the row is included in the query results. This is 
illustrated by the first example in this figure, which will return all rows whose 
terms_id column is equal to 1, 3, or 4. 

You can also use the NOT operator with the IN phrase to test for a value 
that’s not in a list of expressions. This is illustrated by the second example in 
this figure. In this case, only those vendors who are not in California, Nevada, 
or Oregon are retrieved. 

If you look at the syntax of the IN phrase shown at the top of this figure, 
you'll see that you can code a subquery in place of a list of expressions. 
Subqueries are a powerful tool that you’ll learn about in chapter 6. For now, 
though, you should know that a subquery is simply a SELECT statement within 
another statement. In the third example in this figure, for instance, a subquery is 
used to return a list of vendor_id values for vendors who have invoices dated 
May 1, 2008. Then, the WHERE clause retrieves a vendor row only if the 
vendor is in that list. Note that for this to work, the subquery must return a 
single column, in this case, vendor_id. 
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The syntax of the WHERE clause with the IN operator 


WHERE test expression [NOT] IN ({subquery|expression 1 [, expression 2]...}) 


Examples of the IN operator 
The IN operator with a list of numeric literals 
WHERE terms id IN (1, 3, 4) 
The IN operator preceded by NOT 
WHERE vendor state NOT IN ('CA', 'NV', 'OR') 


The IN operator with a subquery 


WHERE vendor id IN 
(SELECT vendor_id 
FROM invoices 
WHERE invoice date = '01-MAY-2008') 


Description 


e ‘You can use the IN operator to test whether an expression is equal to a value in a list of 
expressions. Each of the expressions in the list must evaluate to the same type of data as 
the test expression. 


e The list of expressions can be coded in any order without affecting the order of the rows 
in the result set. 

e ‘You can use the NOT operator to test for an expression that’s not in the list of expres- 
sions. 

e ‘You can also compare the test expression to the items in a list returned by a subquery as 
illustrated by the third example above. You’ll learn more about coding subqueries in 
chapter 6. 


Figure 3-13 How to use the IN operator 
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How to use the BETWEEN operator 


Figure 3-14 shows how to use the BETWEEN operator in a WHERE clause. 
When you use this operator, the value of a test expression is compared to the 
range of values specified in the BETWEEN phrase. If the value falls within this 
range, the row is included in the query results. 

The first example in this figure shows a simple WHERE clause that uses the 
BETWEEN operator. It retrieves invoices with invoice dates between May 1, 
2008 and May 31, 2008. Note that the range is inclusive, so invoices with 
invoice dates of May 1 and May 31 are included in the results. 

The second example shows how to use the NOT operator to select rows that 
are not within a given range. In this case, vendors with zip codes that aren’t 
between 93600 and 93799 are included in the results. 

The third example shows how you can use a calculated value in the test 
expression. Here, the payment_total and credit_total columns are subtracted 
from the invoice_total column to give the balance due. Then, this value is 
compared to the range specified in the BETWEEN phrase. 

The last example shows how you can use calculated values in the BE- 
TWEEN phrase. Here, the first value selects the function SYSDATE (which 
represents the current date), and the second value is SYSDATE plus 30 days. So 
the query results will include all those invoices that are due between the current 
date and 30 days from the current date. 

However, please note the warning about date comparisons in this figure. In 
particular, an invoice-date of May 31, 2008 at 2:00 PM isn’t less than or equal 
to “31-May-2008”, and it isn’t between “01-May-2008” and “31-May-2008”. 
To learn more about date comparisons, please read chapter 8. 
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The syntax of the WHERE clause with the BETWEEN operator 


WHERE test expression [NOT] BETWEEN begin expression AND end expression 


Examples of the BETWEEN operator 


The BETWEEN operator with literal values 
WHERE invoice date BETWEEN 'O1-MAY-2008' AND '31-MAY-=-2008' 


The BETWEEN operator preceded by NOT 
WHERE vendor_zip code NOT BETWEEN 93600 AND 93799 


The BETWEEN operator with a test expression coded as a calculated value 
WHERE invoice_total - payment_total - credit_total BETWEEN 200 AND 500 
The BETWEEN operator with the upper and lower limits coded as calculated 
values 

WHERE invoice_due date BETWEEN SYSDATE AND (SYSDATE + 30) 


Description 


You can use the BETWEEN operator to test whether an expression falls within a range 
of values. The lower limit must be coded as the first expression, and the upper limit must 
be coded as the second expression. Otherwise, the result set will be empty. 

The two expressions used in the BETWEEN operator for the range of values are inclu- 
sive. That is, the result set will include values that are equal to the upper or lower limit. 


You can use the NOT operator to test for an expression that’s not within the given range. 


Warning about date comparisons 


All columns that have the DATE data type include both a date and time, and so does the 
value returned by the SYSDATE function. But when you code a date literal like “31- 
May-2008”, the time defaults to 00:00:00 on a 24-hour clock, or 12:00 AM (midnight). 
As aresult, a date comparison may not yield the results you expect. For instance, May 
31, 2008 at 2:00 PM isn’t between “01-May-2008” and “31-May-2008”. 


To learn more about date comparisons, please see chapter 8. 


Figure 3-14 How to use the BETWEEN operator 
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How to use the LIKE operator 


One final operator you can use in a search condition is the LIKE operator, 
shown in figure 3-15. You use this operator along with the wildcards shown at 
the top of this figure to specify the string pattern, or mask, that you want to 
match. The examples in this figure show how this works. 

In the first example, the LIKE phrase specifies that all vendors in cities that 
start with the letters SAN should be included in the query results. Here, the 
percent sign (%) indicates that any characters can follow these three letters. So 
San Diego and Santa Ana are both included in the results. 

The second example selects all vendors whose vendor name starts with the 
letters COMPU, followed by any one character, the letters ER, and any charac- 
ters after that. Two vendor names that match that pattern are Compuserve and 
Computerworld. 

The LIKE operator provides a powerful technique for finding information in 
a database that can’t be found using any other technique. 


Chapter3 How to retrieve data from a single table 
The syntax of the WHERE clause with the LIKE operator 
WHERE match expression [NOT] LIKE pattern 


Wildcard symbols 


Symbol Description 


% Matches any string of zero or more characters. 


=i Matches any single character. 


WHERE clauses that use the LIKE operator 
Example Results that match the mask 
WHERE vendor city LIKE 'SAN%' “San Diego” and “Santa Ana” 


WHERE vendor name LIKE 'COMPU_ER%' “Compuserve” and “Computerworld” 


Description 


e You use the LIKE operator to retrieve rows that match a string pattern, called a mask. 
Within the mask, you can use special characters, called wildcard characters, that deter- 
mine which values in the column satisfy the condition. 


e You can use the NOT operator before the LIKE operator. Then, only those rows with 
values that don’t match the string pattern will be included in the result set. 


Figure 3-15 How to use the LIKE operator 
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How to use the IS NULL condition 


In chapter 1, you learned that a column can contain a null value. A null isn’t 
the same as zero, a blank string that contains one or more spaces (‘ °), or an 
empty string that doesn’t contain any spaces (‘’). Instead, a null value indicates 
that the data is not applicable, not available, or unknown. When you allow null 
values in one or more columns, you need to know how to test for them in search 
conditions. To do that, you can use the IS NULL condition, as shown in figure 3-16. 

This figure uses a table named null_sample to illustrate how to search for 
null values. This table contains two columns. The first column, invoice_id, is an 
identification column. The second column, invoice_total, contains the total for 
the invoice, which can be a null value. As you can see in the first example, the 
invoice with an invoice_id of 3 contains a null value. 

The second example in this figure shows what happens when you retrieve 
all the invoices with invoice totals equal to zero. Notice that the row with a null 
invoice total isn’t included in the result set. Likewise, it isn’t included in the 
result set that contains all the invoices with invoices totals that aren’t equal to 
zero, as illustrated by the third example. Instead, you have to use the IS NULL 
condition to retrieve rows with null values, as shown in the fourth example. 

You can also use the NOT operator with the IS NULL condition as illus- 
trated in the last example in this figure. When you use this operator, all of the 
rows that don’t contain null values are included in the query results. 
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The syntax of the WHERE clause with the IS NULL condition 
WHERE expression IS [NOT] NULL 


The contents of the Null_Sample table 


SELECT * 
FROM null_sample 


D invoicetp I invoice_toTat 
125 
Q 


nulh 
2199.99 
0 


A SELECT statement that retrieves rows with zero values 


SELECT * 
FROM null_sample 
WHERE invoice total = 0 


INVOICE mg INYORE TOTAL 
1 2 it] 
2 5 0 


rig 


A SELECT statement that retrieves rows with non-zero values 


SELECT * 
FROM null sample 
WHERE invoice total <> 0 


W INVOICE JD | INVOICE_TOTAL 
1 


1 125 
2 4 2199.99 


rim 


A SELECT statement that retrieves rows with null values 


SELECT * 
FROM null sample 
WHERE invoice _total IS NULL 


W Invoice jp | INYOICE_TOTAL 
1 3 (null) 


anr 


A SELECT statement that retrieves rows without null values 


SELECT * 
FROM null_sample 
WHERE invoice_total IS NOT NULL 


D invoice_o |B invoice_ToTaL 
125 
Q 


2199.99 
0 


Figure 3-16 How to use the IS NULL condition 
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How to code the ORDER BY clause 


The ORDER BY clause specifies the sort order for the rows in a result set. 
In most cases, you can use column names from the base table to specify the sort 
order as you saw in some of the examples earlier in this chapter. However, you 
can also use other techniques to sort the rows in a result set, as described in the 
topics that follow. 


How to sort a result set by a column name 


Figure 3-17 presents the expanded syntax of the ORDER BY clause. As you 
can see, you can sort by one or more expressions in either ascending or de- 
scending sequence. This is illustrated by the three examples in this figure. 

The first two examples show how to sort the rows in a result set by a single 
column. In the first example, the rows in the vendors table are sorted in ascend- 
ing sequence by the vendor_name column. Since ascending is the default 
sequence, the ASC keyword is omitted. In the second example, the rows are 
sorted by the vendor_name column in descending sequence. 

To sort by more then one column, you simply list the names in the ORDER 
BY clause separated by commas as shown in the third example. Here, the rows 
in the Vendors table are first sorted by the vendor_state column in ascending 
sequence. Then, within each state, the rows are sorted by the vendor_city 
column in ascending sequence. Finally, within each city, the rows are sorted by 
the vendor_name column in ascending sequence. This can be referred to as a 
nested sort because one sort is nested within another. 

Although all of the columns in this example are sorted in ascending se- 
quence, you should know that doesn’t have to be the case. For example, we 
could have sorted by the vendor_name column in descending sequence like this: 

ORDER BY vendor_state, vendor_city, vendor_name DESC 


Note that the DESC keyword in this example applies only to the vendor_name 
column. The vendor_state and vendor_city columns are still sorted in ascending 
sequence. 

If you study the first example in this figure, you can see that capital letters 
come before lowercase letters in an ascending sort. As a result, “ASC Signs” 
comes before “Abbey Office Furnishings” in the result set. For some business 
applications, this is acceptable. But if it isn’t, you can use the LOWER function 
to convert the column to lowercase letters in the ORDER BY clause like this: 

ORDER BY LOWER (vendor name) 


Then, the rows will be sorted in the correct alphabetical sequence. In chapter 8, 
you can learn more about this function. 
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The expanded syntax of the ORDER BY clause 


ORDER BY expression [ASC|DESC] [, expression [ASC|DESC]] ... 


An ORDER BY clause that sorts by one column in ascending sequence 


SELECT vendor name, 

vendor city || ', ' || vendor_state || ' ' || vender_zip cede AS address 
FROM vendors 
ORDER BY vendor_name 


VENDOR NAME 
1 ASC Signs Frasno, Cå 93703 


2 AT&T Phoenix, 4Z 85062 
3 Abbey Office Furnishings Fresno, Cå 93722 


An ORDER BY clause that sorts by one column in descending sequence 


SELECT vendor name, 

vendor_city || ', ' || vendor_state || ' ' || vendor_zip cede AS address 
FROM vendors 
ORDER BY vendor_name DESC 


BR YENDOR_NAME a ADDRESS 
4 Zylka Design Fresno, CA 93711 


2 Zip Print & Copy Center Fresno, CA, 93777 
3 Zee Medical Service Co Washington, lå, 52353 


An ORDER BY clause that sorts by three columns 


SELECT vendor_name, 

vender city || ', ' || vendor_state || ' ' || vendor_zip code AS address 
FROM vendors 
ORDER BY vendor state, vendor city, vendor _name 


H 6VENDOR_NAME ADDRESS 
1 AT&T Phoenix, AZ 65062 
2 Computer Library Phoenix, 42 65023 
3 Wells Fargo Bank Phoenix, 4Z 85038 


4 Aztek Label Anaheim, Cå 92607 
5 Blue Shield of California Anaheim, CA, 92650 
6 Diversified Printing & Pub Brea, CA 92621 

7 ASC Signs Fresno, CA 93703 


Description 


e The ORDER BY clause specifies how you want the rows in the result set sorted. You can 
sort by one or more columns, and you can sort each column in either ascending (ASC) or 
descending (DESC) sequence. ASC is the default. 

e By default, in an ascending sort, special characters appear first in the sort sequence, 
followed by numbers, then by capital letters, then by lowercase letters, and then by null 
values. In a descending sort, this sequence is reversed. 

e With one exception, you can sort by any column in the base table, regardless of whether 
it’s included in the SELECT clause. The exception is if the query includes the DISTINCT 
keyword. Then, you can only sort by columns included in the SELECT clause. 


Figure 3-17 How to sort a result set by a column name 
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How to sort a result set by an alias, an expression, 
or a column number 


Figure 3-18 presents three more techniques that you can use to specify sort 
columns. First, you can use a column alias that’s defined in the SELECT clause. 
The first SELECT statement in this figure, for example, sorts by a column named 
address, which is an alias for the concatenation of the vendor_city, vendor_state, 
and vendor_zip_code columns. Within the address column, the result set is sorted 
by the vendor_name column. 

You can also use an arithmetic or string expression in the ORDER BY clause, 
as illustrated by the second example in this figure. Here, the expression consists 
of the vendor_contact_last_name column concatenated with the 
vendor_contact_first_name column. Here, neither of these columns is included in 
the SELECT clause. 

The last example in this figure shows how you can use column numbers to 
specify a sort order. To use this technique, you code the number that corresponds 
to the column of the result set, where 1 is the first column, 2 is the second 
column, and so on. In this example, the ORDER BY clause sorts the result set by 
the second column, which contains the concatenated address, then by the first 
column, which contains the vendor name. The result set returned by this state- 
ment is the same as the result set returned by the first statement. 

Notice, however, that the statement that uses column numbers is more 
difficult to read because you have to look at the SELECT clause to see what 
columns the numbers refer to. In addition, if you add or remove columns from 
the SELECT clause, you may also have to change the ORDER BY clause to 
reflect the new column positions. As a result, we don’t recommend this coding 
technique. 
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An ORDER BY clause that uses an alias 


SELECT vendor name, 

vendor city || ', ' || vendor_state || ' ' || vendor_zip code AS address 
FROM vendors 
ORDER BY address, vendor_name 


YENDOR NAME $ ADDRESS 
1 Aztek Label Anahelm, C4 92607 
2 Blue Shield of California Anahelm, C4 92650 


3 Malloy Lithographing Inc Ann Arbor, Ml 48106 
4 Data Reproductions Corp Auburn Hills, Ml 46326 


An ORDER BY clause that uses an expression 


SELECT vendor name, 


vendor city || ', ' || vendor state || ' ' || vendor zip code AS address 
FROM vendors 
ORDER BY vendor_contact_last_name || vendor_contact_first_name 


@  VENDOR_NAME 
1 Dristas Groom & McCormick Fresno, CA 93720 
2 Internal Revenue Service Fresno, CA 93666 
3 US Postal Service Madison, W 53707 
4 Yale Industrial Trucks-Fresno Fresno, CA 93706 


An ORDER BY clause that uses column positions 


SELECT vendor name, 

vendor city || ', ' || vendor_state || ' ' || vendor_zip code AS address 
FROM vendors 
ORDER BY 2, 1 


8 YENDOR MåME 4 ADDRESS 
1 Aztek Label Anahalm, Cå, 92607 
2 Blue Shield of California Anahalm, Cå, 92650 


3 Malloy Lithographing Inc Ann Arbor, Ml 46106 
4 Data Reproductions Corp Auburn Hills, Ml 48326 


Description 


The ORDER BY clause can include a column alias that’s specified in the SELECT 
clause. 

The ORDER BY clause can include any valid expression. The expression can refer to 
any column in the base table, even if it isn’t included in the result set. 

The ORDER BY clause can use numbers to specify the columns to use for sorting. In 
that case, 1 represents the first column in the result set, 2 represents the second column, 
and so on. 


Figure 3-18 How to sort a result set by an alias, an expression, or a column number 
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Perspective 


The goal of this chapter has been to teach you the basic skills for coding 
SELECT statements. You’ll use these skills in almost every SELECT statement 
you code. As you’ll see in the chapters that follow, however, there’s a lot more 
to coding SELECT statements than what’s presented here. In the next three 
chapters, then, you’ll learn additional skills for coding SELECT statements. 


Terms 


base table 

keyword 

filter 

Boolean expression 
predicate 

expression 

column alias 

string expression 
concatenate 
concatenation operator 
literal value 

string literal 

string constant 
arithmetic expression 
arithmetic operator 


Exercises 


order of precedence 
function 

parameter 

date literal 
comparison operator 
logical operator 
compound condition 
pseudo column 
subquery 

string pattern 

mask 

wildcard 

null value 

nested sort 


Run some of the examples in this chapter 
In these exercises, you’ll use Oracle SQL Developer to run some of the scripts 
for the examples in this chapter. This assumes that you already know how to use 


SQL Developer, as described in chapter 2. 
1. Start Oracle SQL Developer. 


2. Open the script for fig3-02a that you should find in this directory: 
c:\murach\oracle_sql\scripts\ch03. Then, press the F9 key or click on the 
Execute Statement button to run the script. This shows you the data that’s in the 
Invoices table that you’ll be working with in this chapter. 


3. Open and run the script for fig3-02b. 


Open and run the scripts for any of the other examples in this chapter that 


you’re interested in reviewing. 
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Enter and run your own SELECT statements 


In these exercises, you’ll enter and run your own SELECT statements. To do that, 
you can open the script for an example that is similar to the statement you need 
to write, copy the statement into a new Worksheet window, and then modify the 
statement. That can save you both time and syntax errors. 


5. 


Write a SELECT statement that returns three columns from the Vendors table: 
vendor_name, vendor_contact_last_name, and vendor_contact_first_name. 
Then, run this statement. 


Next, add code to this statement so it sorts the result set by last name and then 
first name. Then, run this statement again. This is a good way to build and test a 
statement, one clause at a time. 


Write a SELECT statement that returns one column from the Vendors table 
named full_name. Create this column from the vendor_contact_first_name and 
vendor_contact_last_name columns, and format it like this: last name, comma, 
space, first name (for example, “Doe, John”). Next, sort the result set by last 
name and then first name. Then, filter the result set for contacts whose last name 
begins with the letter A, B, C, or E. 


Write a SELECT statement that returns four columns from the Invoices table 
named Due Date, Invoice Total, 10%, and Plus 10%. These columns should 
contain this data: 


Due Date The invoice_due_date column 
Invoice Total The invoice_total column 

10% 10% of the value of invoice_total 
Plus 10% The value of invoice_total plus 10% 


(For example, if invoice_total is 100, 10% is 10, and Plus 10% is 110.) Next, 
filter the result set so it returns only those rows with an invoice total that’s 
greater than or equal to 500 and less than or equal to 1000. Then, sort the result 
set in descending sequence by invoice_due_date. 


Write and run a SELECT statement that returns four columns from the Invoices 
table named Number, Total, Credits, and Balance Due. These columns should 
include this data: 


Number The invoice_number column 
Total The invoice_total column 
Credits Sum of the payment_total and credit_total columns 


Balance Due _Invoice_total minus the sum of payment_total and credit_total 


Next, filter for invoices with a balance due that’s greater than or equal to $500. 
Then, sort the result set by balance due in descending sequence. Last, use the 
ROWNUM pseudo column so the result set contains only the rows with the 10 
largest balance dues. 
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Work with nulls and use the Dual table 


9. 


10. 


Write a SELECT statement that returns the balance due and the payment date 
from the Invoices table, but only when the payment_date column contains a 
null value. Then, modify the WHERE clause so it returns any invalid rows 
(rows in which the balance due is zero and the payment date is null). 

Use the Dual table to create a row with these columns: 

Starting Principal Starting principle which should be equal to $51,000 
New Principal Starting principal plus a 10% increase 

Interest 6.5% of the new principal 


Principal + Interest The new principal plus the interest (add the expression 
you used for the new principal calculation to the 
expression you used for the interest calculation) 

Now, add a column named “System Date” that uses the TO_CHAR function to 

show the results of the SYSDATE function when it’s displayed with this 

format: 

'dd-mon-yyyy hh24:mi:ss' 

This format will display the day, month, year, hours, minutes, and seconds of 

the system date, and this will show that the system date also includes a time. 

(You should be able to figure out how to use the TO_CHAR and SYSDATE 

functions by studying figure 3-7.) 
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How to retrieve data from 
two or more tables 


In the last chapter, you learned how to create result sets that contain data from a 
single table. Now, this chapter will show you how to create result sets that 
contain data from two or more tables. To do that, you can use either a join or a 
union. 
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How to work with inner joins 


A join lets you combine columns from two or more tables into a single 
result set. In the topics that follow, you’ll learn how to use the most common 
type of join, an inner join. You'll learn how to use other types of joins later in 
this chapter. 


How to code an inner join 


Figure 4-1 presents the explicit syntax for coding an inner join. This syntax 
was introduced in version 9i of Oracle. As you’ll see later in this chapter, Oracle 
also provides an implicit syntax that you can use to code inner joins. However, 
the syntax shown in this figure is the one you’ll use most often. 

To join data from two tables, you code the names of the two tables in the 
FROM clause along with the JOIN keyword and an ON phrase that specifies the 
join condition. The join condition indicates how the two tables should be 
compared. In most cases, they’re compared based on the relationship between 
the primary key of the first table and a foreign key of the second table. 

The SELECT statement in this figure, for example, joins data from the 
Vendors and Invoices tables based on the vendor_id column in each table. 
Notice that because the equal operator is used in this condition, the value of the 
vendor_id column in a row in the Vendors table must match the vendor_id in a 
row in the Invoices table for that row to be included in the result set. In other 
words, only vendors with one or more invoices will be included. Although 
you’ ll code most inner joins using the equal operator, you should know that you 
can compare two tables based on other conditions too. 

In this example, the Vendors table is joined with the Invoices table using a 
column that has the same name in both tables: vendor_id. Because of that, the 
columns must be qualified to indicate which table they come from. As you can 
see, you code a qualified column name by entering the table name and a period 
in front of the column name. Although this example uses qualified column 
names only in the join condition, you must qualify a column name anywhere it 
appears in the statement if the same name occurs in both tables. If you don’t, 
Oracle will return an error indicating that the column name is ambiguous. Of 
course, you can also qualify column names that aren’t ambiguous. However, we 
recommend you do that only if it clarifies your code. 
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The explicit syntax for an inner join 


SELECT select list 
FROM table_1 
[INNER] JOIN table 2 
ON join_condition_1 
[ [INNER] JOIN table 3 
ON join_condition_2]... 


A SELECT statement that joins the Vendors and Invoices tables 


SELECT invoice number, vendor_name 
FROM vendors INNER JOIN invoices 

ON vendors.vendor_id = invoices.vendor_id 
ORDER BY invoice number 


The result set 
@  INVOICE_NUMBER |) VENDOR NAME 
1 0-2056 Malloy Lithagraphing Inc a 
2 0-2060 Malloy Lthographing Inc i 
| 


3 0-2436 Malloy Lithagraphing Inc 


4 4-200-5164 Fadaral Express Corporation 


(114 rows selected) 


Description 


A join is used to combine columns from two or more tables into a result set based on the 
join conditions you specify. For an inner join, only those rows that satisfy the join 
condition are included in the result set. 


A join condition names a column in each of the two tables involved in the join and 
indicates how the two columns should be compared. In most cases, you use the equal 
Operator to retrieve rows with matching columns. However, you can also use any of the 
other comparison operators in a join condition. 


In most cases, you'll join two tables based on the relationship between the primary key 
in one table and a foreign key in the other table. However, you can also join tables based 
on relationships not defined in the database. These are called ad hoc relationships. 


If the two columns in a join condition have the same name, you have to qualify them 
with the table name so Oracle can distinguish between them. To code a qualified column 
name, type the table name, followed by a period, followed by the column name. 


Notes 


The INNER keyword is optional and is seldom used. 

This syntax for coding an inner join can be referred to as the explicit syntax. It is also 
called the SQL-92 syntax because it was introduced by the SQL-92 standards. 

You can also code an inner join using the implicit syntax. See figure 4-7 for more infor- 
mation. 


Figure 4-1 How to code an inner join 
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When and how to use table aliases 


When you name a table to be joined in the FROM clause, you can refer to 
the table by an alias. A table alias is similar to a column alias, except that you 
do not use the word AS, like you do when you assign a column alias. After you 
assign a table alias, you must use the alias in place of the original table name 
throughout the query. This is illustrated in figure 4-2. 

The first SELECT statement in this figure joins data from the Vendors and 
Invoices tables. Here, both tables have been assigned aliases that consist of a 
single letter. Table aliases are used to reduce typing and to make a query more 
understandable, particularly when table names are lengthy. 

The alias used in the second SELECT statement in this figure, for example, 
simplifies the name of the Invoice_Line_Items table to just Line_Items. That 
way, the shorter name can be used to refer to the invoice_id column of the table 
in the join condition. Although this doesn’t improve the query in this example 
much, it can have a dramatic effect on a query that refers to the 
Invoice_Line_Items table several times. 
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The syntax for an inner join that uses table aliases 


SELECT select list 
FROM table 1 nl 
[INNER] JOIN table 2 n2 
ON nl.column_name operator n2.column name 
[ [INNER] JOIN table 3 n3 
ON n2.column name operator n3.column_ name]... 


An inner join with aliases for all tables 


SELECT invoice_number, vendor_name, invoice _due_date, 
(invoice total - payment total - credit_total) AS balance due 
FROM vendors v JOIN invoices i 
ON v.vendor_ id = i.vendor_id 
WHERE (invoice total - payment_total - credit total) > 0 
ORDER BY invoice due date DESC 


The result set 


INYOICE_NUMBER YENDOR_NA&AME INYOICE_DUE_DATE BALANCE_DUE 
1 40318 Data Reproductions Corp 20-JUL-08 21642 


2 39104 Data Reproductions Corp 20-JUL-08 66.31 
3 0-2436 Malloy Lthographing Ine 47-JUL-08 10976.06 


(40 rows selected) 


An inner join with an alias for only one table 


SELECT invoice_number, line _ item amt, line_item description 
FROM invoices JOIN invoice line items line items 
ON invoices.invoice_id = line_items.invoice_id 
WHERE account number = 540 
ORDER BY invoice date 


The result set 


p INYOICE_NUMBER g LINE_|TEM_&MT p LINE_ITEM_DESCRIPTION 
1 97/5538 313.55 Card revision 


2 97/553 904.14 DB2 Card dacks 
3 97/522 765.13 SCMD Flyer 


(8 rows selected) 


Description 


A table alias is an alternative table name assigned in the FROM clause. You can use an 
alias when a long table name would make qualified column names long or confusing. 


If you assign an alias to a table, you must use that alias to refer to the table throughout 
your query. You can’t use the original table name. 


You can use an alias for any table in a join without using an alias for any other table. 


Use table aliases whenever they simplify or clarify the query. Avoid using them when 
they make a query more confusing or difficult to read. 


Figure 4-2 When and how to use table aliases 
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How to work with tables from different schemas 


If you use the procedure in figure A-5 of appendix A to create the tables for 
this book, all of the tables will be organized into three schemas. First, all tables 
pertaining to accounts payable such as the Vendors and Invoices tables are 
stored in the schema named AP. Then, all tables pertaining to order management 
are stored in a schema named OM. Finally, all tables that are used by the 
smaller examples presented in this book are stored in a schema named EX. 

In addition, the procedure in figure A-5 creates one user that is the owner 
for each schema. In particular, a user named AP is the owner of the AP schema, 
a user named OM is the owner of the OM schema, and a user named EX is the 
owner of the EM schema. 

When you connect to Oracle as a user that’s the owner of a schema, you do 
not need to qualify any table name with the schema name. This is the case most 
of the time. If, for example, you connect as the AP user, you can access any of 
the tables in the AP schema. However, you may occasionally need to join to a 
table that’s in another schema. To do that, you must qualify the table name in 
the other schema by prefixing the table name with the schema name. This is 
shown in figure 4-3. 

Before you can work with tables in another schema, you must log on as a 
user that has appropriate permissions to work with the tables in that schema. 
The first example in this figure shows a statement that can be used to grant 
permissions to the AP user that allows that user to select data from the Custom- 
ers table in the OM schema. To run this statement, you must log on as the OM 
user, which has the appropriate permissions to grant this permission to the AP 
user. 

For now, that’s all you need to know about granting permissions. However, 
you'll learn more about users and permissions in chapter 12. Usually, though, 
you won’t need to worry about this since your database administrator (DBA) 
will be responsible for granting the appropriate permissions to each user. 

Once you have granted the appropriate permissions to the AP user, you can 
log on as the AP user and run the SELECT statement shown in the second 
example. This statement joins data from two tables (Vendors and Customers) in 
two different schemas (AP and OM). To do that, this statement qualifies the 
Customers table with the name of the OM schema. Since this example assumes 
that you’re logged on as the AP user, it isn’t necessary to qualify the Vendors 
table with the name of the AP schema. However, if you thought it improved the 
readability of the statement, you could qualify the Vendors table too, and the 
result of the query would be the same. 
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The syntax of a table name that’s qualified with a schema name 


schema _name.table name 


A SQL statement that grants the SELECT permission on the Customers 
table in the OM schema to the AP schema 
GRANT SELECT ON customers TO ap 


A SQL statement that joins to a table from another schema 


SELECT vendor name, customer_last_name, customer first name, 
vendor state AS state, vendor city AS city 

FROM vendors v 
JOIN om.customers c 
ON v.vendor zip code = c.customer zip 

ORDER BY state, city 


The result set 


VENDOR_NAME p CUSTOMER_LAST_NAME CUSTOMER_FIRST_NAME |STATE CITY 
1 Wells Fargo Bank Marissa Kyle Phoenix 
2 Aztek Label Irvin Ania Anaheim 
3 Lou Gertile's Flower Basket Damien Deborah Fresno 
4 Shields Design Damian Deborah Fresno 


5 Costco Nattaly Thalia Frasno 
6 Costco Holkrooke Rashad Fresno 
7 Gary McKalghan Insurance Holbrooke Rashad Frasno 
6 Zylka Design Nattaly Thalia Frasno 
3 Zylka Design Holbrooke Rashad Frasno 


(37 rows) 


Description 


e Before you can access a table in another schema, you must log on as a user that has 
appropriate access permissions. 


e To grant access permissions to a user, you can use a GRANT statement. For more 
information about using the GRANT statement, see chapter 12. 


e To join to a table in another schema, you must prefix the table name with the schema 
name. 


Figure 4-3 How to work with tables from different schemas 
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How to use compound join conditions 


Although a join condition typically consists of a single comparison, you can 
include two or more comparisons in a join condition using the AND and OR 
operators. Figure 4-4 illustrates how this works. 

In the first SELECT statement in this figure, you can see that the Invoices 
and Invoice_Line_Items tables are joined based on two comparisons. First, the 
primary key of the Invoices table, invoice_id, is compared with the foreign key 
of the Invoice_Line_Items table, also named invoice_id. As in previous ex- 
amples, this comparison uses an equal condition. Then, the invoice_total 
column in the Invoices table is tested for a value greater than the value of the 
invoice_line_item_amt column in the Invoice_Line_Items table. That means 
that only those invoices that have two or more line items will be included in the 
result set. You can see part of this result set in this figure. 

Another way to code these conditions is to code the primary join condition 
in the FROM clause and the other condition in the WHERE clause. This is 
illustrated by the second SELECT statement in this figure. 

When you code separate compound join conditions like this, you may 
wonder which technique is most efficient. However, Oracle typically does a 
good job of optimizing queries so they run efficiently regardless of how you 
write the query. As a result, you can usually focus on writing a query so the 
code is easy to read and maintain. Then, if you encounter performance problems 
with your queries, you can work on optimizing the query later. 
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An inner join with two conditions 


SELECT invoice number, invoice date, 
invoice_total, line_item_amt 
FROM invoices i JOIN invoice line items 1i 
ON (i.invoice_id = li.invoice_id) AND 
(1.invoice total > 1i.line item_amt) 
ORDER BY invoice number 


The same join with the second condition coded in a WHERE clause 


SELECT invoice_number, invoice date, 
invoice total, line_item_amt 

FROM invoices i JOIN invoice_line items li 
ON i.invoice id = li.invoice id 

WHERE i.invoice total > li.line item_amt 


ORDER BY invoice number 
The result set 


D invoice_numBer A invoice_paTe A invoice_toTaL | LINE JTEM_AMT 
1 97/522 30-APR-08 4962.13 


2 97/522 30-APR-08 1962.13 
3 177271-001 O5-JUN-08 662 
4 77271-001 05-JUN-08 662 


(6 rows selected) 


Description 
e A join condition can include two or more conditions connected by AND or OR opera- 
tors. 


e In most cases, your code will be easier to read if you code the join condition in the ON 
expression and search conditions in the WHERE clause. 


Figure 4-4 How to use compound join conditions 
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How to use a self-join 


A self-join is a join where a table is joined with itself. Although self-joins 
are rare, there are some unique queries that are best solved using self-joins. 

Figure 4-5 presents an example of a self-join that uses the Vendors table. 
Notice that since the same table is used twice, aliases are used to distinguish 
one occurrence of the table from the other. In addition, each column name used 
in the query is qualified by the alias since the columns occur in both tables. 

The join condition in this example uses three comparisons. The first two 
match the vendor_city and vendor_state columns in the two tables. As a result, 
the query will return rows for vendors that reside in the same city and state as 
another vendor. Because a vendor resides in the same city and state as itself, 
however, a third comparison is included to exclude rows that match a vendor 
with itself. To do that, this condition uses the not equal operator to compare the 
vendor_id columns in the two tables. 

Notice that the DISTINCT keyword is also included in this SELECT 
statement. That way, a vendor appears only once in the result set. Otherwise, it 
would appear once for each row with a matching city and state. 

This example also shows how you can use columns other than key columns 
in a join condition. Keep in mind, however, that this is an unusual situation and 
you’re not likely to code joins like this often. 
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A self-join that returns vendors from cities in common with other vendors 


SELECT DISTINCT vl.vendor_name, vi.vendor city, 
vi.vendor_state 
FROM vendors v1 JOIN vendors v2 
ON (vl.vendor_city = v2.vendor_city) AND 
(vl.vendor_ state = v2.vendor state) AND 
(v1l.vendor_id <> v2.vendor_id) 
ORDER BY vi.vendor_state, vl.vendor city 


The result set 


By  YENDOR_NAME f YENDOR_CITY |VENDOR_STATE 
1 AT&T Phoenix 
2 Computer Library Phoenix 
3 Wells Fargo Bank Phoenix 
4 Aztek Label Anaheim 
§ Blue Shield of California Anaheim 
6 ASC Signs Fresno 


7 Abbey Office Furnishings Fresno 
8 EFI Industries Fresno 


(84 rows selected) 


Description 
e A self-join is a join that joins a table with itself. 


e When you code a self-join, you must use aliases for the tables, and you must qualify 
each column name with the alias. 


Figure 4-5 How to use a self-join 
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So far in this chapter, you’ve seen how to join data from two tables. De- 
pending on the requirement, however, you may have to join many more tables. 
For example, you sometimes need to join 10 or more tables. 

The SELECT statement in figure 4-6 joins data from four tables: Vendors, 
Invoices, Invoice_Line_Items, and General_Ledger_Accounts. Each of the joins 
is based on the relationship between the primary key of one table and a foreign 
key of the other table. For example, the account_number column is the primary 
key of the General_Ledger_Accounts table and a foreign key of the 
Invoice_Line_Items table. 

This SELECT statement also begins to show how table aliases make a 
statement easier to code and read. Here, the one-letter and two-letter aliases that 
are used by the tables allow you to code the ON clause more concisely. 
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A SELECT statement that joins four tables 


SELECT vendor name, invoice number, invoice date, 
line_item_amt, account_description 
FROM vendors v 
JOIN invoices i 
ON v.vendor id = i.vendor id 
JOIN invoice_line items li 
ON i.invoice id = li.invoice id 
JOIN general ledger_accounts gl 
ON li.account_number = gl.account number 
WHERE (invoice total - payment_total - credit total) > 0 
ORDER BY vendor name, line item_amt DESC 


The result set 


VENDOR_NAME @ invoice_NuMBeR |) INvoIcE_DATE LINE_ITEM_AMT |] ACCOUNT_DESCRIPTION 
1 Abbey Office Furnishings 203339-13 02-MAY-08 17.5 Office Supplies 
2 Blue Cross 547481328 20-MAY-08 224 Group Insurance 
3 Blue Cross 547460102 19-MAY-~08 224 Group Insurance 
4 Blue Cross 547479217 17-MAY-08 116 Group Insurance 


5 Cardinal Business Media, Inc. 134116 O41 -JUN-08 90.36 Card Dack Advertising 

6 Coffee Break Service 109596 14-JUN-08 41.8 Meals 

7 Compuserve 21-47480363 O9-MAY-08 8.95 Books, Duss, and Subscriptions 
8 Computerworld 367447 31-MAY-08 2433 Card Deck Advertising 


(44 rows selected) 


Description 


e You can think of a multi-table join as a series of two-table joins proceeding from left to 
right. 


Figure 4-6 Inner joins that join more than two tables 
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How to use the implicit inner join syntax 


Earlier in this chapter, we mentioned that Oracle provides an implicit syntax 
for joining tables. This syntax was used prior to Oracle 9i. Although we recom- 
mend that you use the explicit syntax when you’re developing SQL statements 
for newer versions of Oracle, you should be familiar with the implicit syntax in 
case you ever need to maintain existing SQL statements that use it. 

Figure 4-7 presents the implicit syntax for an inner join, along with two 
statements that use it. As you can see, the tables to be joined are simply listed in 
the FROM clause. Then, the join conditions are included in the WHERE clause. 

The first SELECT statement joins data from the Vendors and Invoices 
tables. Like the SELECT statement you saw back in figure 4-1, these tables are 
joined based on an equal comparison between the vendor_id columns in the two 
tables. In this case, though, the comparison is coded as the search condition of 
the WHERE clause. If you compare the result set shown in this figure with the 
one in figure 4-1, you’ll see that they’re identical. 

The second SELECT statement uses the implicit syntax to join data from 
four tables. This is the same join you saw in figure 4-6. Notice in this example 
that the three join conditions are combined in the WHERE clause using the 
AND operator. In addition, an AND operator is used to combine the join condi- 
tions with the search condition. 

Because the explicit syntax for joins lets you separate join conditions from 
search conditions, statements that use the explicit syntax are typically easier to 
read than those that use the implicit syntax. In addition, the explicit syntax helps 
you avoid a common coding mistake with the implicit syntax: omitting the join 
condition. As you’ll learn later in this chapter, an implicit join without a join 
condition results in a cross join, which can return a large number of rows. For 
these reasons, we recommend that you use the explicit syntax in all your new 
SQL code. 
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The implicit syntax for an inner join 


SELECT select list 
FROM table_1, table 2 [, table 3]... 
WHERE table 1.column name operator table 2.column name 
[AND table 2.column_name operator table_3.column name]... 


A SELECT statement that joins the Vendors and Invoices tables 


SELECT invoice_number, vendor_name 
FROM vendors v, invoices i 

WHERE v.vendor_id = i.vendor_id 
ORDER BY invoice number 


The result set 
@ Invoice_NUMBER | VENDOR_NAME 
1 0-2058 Malloy Lithographing Inc 
2 0-2060 Malloy Lithographing Inc 


3 0-2436 Malloy Lithographing Inc 
4 1-200-5164 Federal Express Corporation 
5 1-202-2978 Federal Express Corporation 


(114 rows selected) 


A statement that joins four tables 


SELECT vendor_name, invoice_number, invoice date, 
line_item_amt, account description 
FROM vendors v, invoices i, invoice_line_ items li, 
general ledger_accounts gl 
WHERE v.vendor_id = i.vendor_id 
AND i.invoice id = li.invoice id 
AND li.account_number = gl.account_number 
AND (invoice total - payment total - credit total) > 0 
ORDER BY vendor_name, line item amt DESC 


The result set 
p YENDOR_NAME INYOICE_NUMBER p INYOICE_DATE LINE_ITEM_AMT p ACCOUNT_DESCRIPTION 
1 Abbey Office Furnishings 203339-13 02-MAY-08 17.5 Office Supplies 
2 Blue Cross 547460102 19-MAY -08 224 Group Insurance 


3 Blue Cross 547461326 20-MA‘Y-08 224 Group Insurance 
4 Blua Cross 547479217 1417-MAY-06 116 Group Insurance 
5 Cardinal Business Madla, Inc. 134116 04 -JUN-08 90,36 Card Dack Advertising 


(44 rows selected) 


Description 


e Instead of coding a join condition in the FROM clause, you can code it in the WHERE 
clause along with any search conditions. Then, you simply list the tables you want to join 
in the FROM clause separated by commas. 

e This syntax for coding joins is referred to as the implicit syntax, or the theta syntax. It 
was used prior to the SQL-92 standards, which introduced the explicit syntax. 

e If you omit the join condition from the WHERE clause, a cross join is performed, You’ll 
learn about cross joins later in this chapter. 


Figure 4-7 How to use the implicit inner join syntax 
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How to work with outer joins 


Although inner joins are the type of join you’ll use most often, Oracle also 
supports outer joins. Unlike an inner join, an outer join returns all of the rows 
from one or both tables involved in the join, regardless of whether the join 
condition is true. You’ll see how this works in the topics that follow. 


How to code an outer join 


Figure 4-8 presents the explicit syntax for coding an outer join. Because this 
syntax is similar to the explicit syntax for inner joins, you shouldn’t have any 
trouble understanding how it works. The main difference is that you include the 
LEFT, RIGHT, or FULL keyword to specify the type of outer join you want to 
perform. As you can see in the syntax, you can also include the OUTER key- 
word, but it’s optional and is usually omitted. 

The table in this figure summarizes the differences between left, right, and 
full outer joins. When you use a left outer join, the result set includes all the 
rows from the first, or left, table. Similarly, when you use a right outer join, the 
result set includes all the rows from the second, or right, table. And when you 
use a full outer join, the result set includes all the rows from both tables. 

The example in this figure illustrates a left outer join. Here, the Vendors 
table is joined with the Invoices table. Notice that the result set includes vendor 
rows even if no matching invoices are found. In that case, null values are 
returned for the columns in the Invoices table. 

When coding outer joins, it’s a common practice to avoid using right joins. 
To do that, you can substitute a left outer join for a right outer join by reversing 
the order of the tables in the FROM clause and using the LEFT keyword instead 
of RIGHT. This often makes it easier to read statements that join more than two 
tables. 
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The explicit syntax for an outer join 


SELECT select list 
FROM table 1 
{LEFT|RIGHT|FULL} [OUTER] JOIN table 2 
ON join _condition_1 
[{LEFT|RIGHT|FULL} [OUTER] JOIN table 3 
ON join _condition_2]... 


What outer joins do 
Joins of this type Keep unmatched rows from 
Left outer join The first (left) table 


Right outer join The second (right) table 
Full outer join Both tables 


A SELECT statement that uses a left outer join 


SELECT vendor_name, invoice_number, invoice_total 
FROM vendors LEFT JOIN invoices 

ON vendors.vendor_id = invoices.vendor_id 
ORDER BY vendor name 


The result set 


a YENDOR NAME J INVOICE_NUMBER | INYOICE_TOTAL 
1 ASC Signs truli (null) 
2 AT&T Cull) (null) 
3 Abbey Office Furnishings 203339-13 17.5 
4 American Booksellers Assoc (null) (null) 


5 American Express {null (null) 


(202 rows selected) 


Description 
e An outer join retrieves all rows that satisfy the join condition, plus unmatched rows in 
one or both tables. 


e In most cases, you use the equal operator to retrieve rows with matching columns. 
However, you can also use any of the other comparison operators. 


e When a row with unmatched columns is retrieved, any columns from the other table that 
are included in the result set are given null values. 


Notes 
e The OUTER keyword is optional and typically omitted. 


e You can also code left outer joins and right outer joins using the implicit syntax. See 
figure 4-11 for more information. 


Figure 4-8 How to code an outer join 
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Outer join examples 


To give you a better understanding of how outer joins work, figure 4-9 
presents three more examples. These examples use the Departments and Em- 
ployees tables shown at the top of this figure. In each case, the join condition 
joins the tables based on the values in their department_number columns. 

The first SELECT statement performs a left outer join on these two tables. 
In the result set produced by this statement, you can see that department number 
3 (Operations) is included in the result set even though none of the employees in 
the Employees table work in that department. Because of that, a null value is 
assigned to the last_name column from that table. 

The second SELECT statement uses a right outer join. In this case, all of the 
rows from the Employees table are included in the result set. Notice, however, 
that two of the employees, Locario and Watson, are assigned to a department 
that doesn’t exist in the Departments table. Of course, if the department_number 
column in this table had been defined as a foreign key to the Departments table, 
this would not have been allowed. In this case, though, a foreign key wasn’t 
defined, so null values are returned for the department_name column in these 
two rows. 

The third SELECT statement in this figure illustrates a full outer join. If you 
compare the results of this query with the results of the queries that use a left 
and right outer join, you’ll see that this is a combination of the two joins. In 
other words, each row in the Departments table is included in the result set, 
along with each row in the Employees table. Because the department_number 
column from both tables is included in this example, you can clearly identify 
the row in the Departments table that doesn’t have a matching row in the 
Employees table and the two rows in the Employees table that don’t have 
matching rows in the Departments table. 


Chapter 4 


The Departments table 


F] DEPARTMENT_NUMBER F DEPARTMENT_NAME 
1 Accounting 
2 Payroll 


3 Operations 
4 Personnel 


§ Maintenance 


A left outer join 


SELECT department_name AS dept_name, 
d.department number AS dept_no, 
last_name 

FROM departments d 
LEFT JOIN employees e 
ON d.department number = 

e.department_number 

ORDER BY department _name 


A right outer join 


SELECT department name AS dept name, 
e.department number AS dept_no, 
last_name 

FROM departments d 
RIGHT JOIN employees e 
ON d.department number = 

e.department number 

ORDER BY department name 


A full outer join 


SELECT department_name 
AS dept_name, 
d.department_number 
AS d_dept_no, 
e.department_number 
AS e_dept_no, 
last_name 
FROM departments d 
FULL JOIN employees e 
ON d.department_number = 
e.department_number 
ORDER BY department_name 


How to retrieve data from two or more tables 


1 Cindy 
2 Elmer 

3 Ralph 

4 Ollvla 

§ Robert 
6 Denise 
7 Thomas 
8 Rhea 

9 Paulo 


DEPT_NAME 
1 Accounting 
2 Malntenance 
3 Operations 
4 Payroll 


5 Payroll 
6 Payroll 
7 Personnel 


8 Personnel 
9 (nul) 
10 (null) 


The Employees table 


@ empLoveeo |) First_ame|f] LasT_Name |f] DEPARTMENT NUMBER 


Smith 
Jones 
Simonian 
Hernandez 
Aaronsen 
Watson 
Hardy 
O'Leary 
Locarlo 


no fra oe > uu b LUD 


@ pept_Name |) pepT_No |} LAST_NAME 


1 Accounting 
2 Maintenance 
3 Operations 

4 Payroll 

§ Payroll 

6 Payroll 

7 Personnel 

8 Personnel 


1 Hernandez 
5 Hardy 

3 (null) 

2 Smith 

2 Simonian 

2 Aaronsen 
4 Jones 

4 O'Leary 


A bePT_NAME|[J DEPT_NO |] LAST_NAME 


1 Accounting 
2 Maintenance 
3 Payroll 

4 Payroll 

5 Payroll 

6 Personnel 

7 Personnel 

8 Crull) 

9 inul) 


1 Hernandez 
5 Hardy 

2 Aaronsen 
2 Simonian 

2 Smith 

4 Jones 

4 O'Leary 

6 Locario 

B Watson 


I D DEPT_NO |} E_pEPT_No |) LAST_NAME 


e BN KH KH WD = 


(null) 
(null) 


1 Hernandez 
5 Hardy 


(ruil) (null) 


2 Simonian 
2 Smith 

2 Aaronsen 
4 O'Leary 

4 Jones 

6 Watson 

6 Locario 


Figure 4-9 Outer join examples 
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Like inner joins, you can use outer joins to join data from more than two 
tables. The two examples in figure 4-10 illustrate how this works. These ex- 
amples use the Departments and Employees tables you saw in the previous 
figure, along with a Projects table. All three tables are shown at the top of this 
figure. 

The first example in this figure uses left outer joins to join the data in the 
three tables. Here, you can see once again that none of the employees in the 
Employees table are assigned to the Operations department. Because of that, 
null values are returned for the columns in both the Employees and Projects 
tables. In addition, you can see that two employees, Hardy and Jones, aren’t 
assigned to a project. 

The second example in this figure uses full outer joins to join the three 
tables. This result set includes unmatched rows from the Departments and 
Employees table, just like the result set you saw in figure 4-9 that was created 
using a full outer join. In addition, the result set in this example includes an 
unmatched row from the Projects table: the one for project number P1014. In 
other words, none of the employees are assigned to this project. 


Chapter 4 


The Departments table 


F] DEPARTMENT_NUMBER F DEPARTMENT_NAME 
1 Accounting 


2 Payrall 


3 Operations 
4 Personnel 
5 Maintenance 


The Projects table 
Ü| PROJECT_NUMBER EMPLOYEE JD | 


The Employees table 


EMPLOYEE JD | FIRST_NaME |) LAST_Name | DEPARTMENT NUMBER 
1 Cindy Smith 2 
2 Elmer Jones 4 
3 Ralph Simonian 2 
4 Olivia Hernandez 1 
5 Robert Aaronsen 2 
6 Denise Watson 5 
7 Thomas Hardy 5 
8 Rhea O'Leary 4 
9 Paulo Locario 6 


A SELECT statement that joins the three tables using left outer joins 


SELECT department_name, 
last_name, 
project_number AS proj_no 
FROM departments d 
LEFT JOIN employees e 
ON d.department_ number = 
e.department_number 
LEFT JOIN projects p 
ON e.employee id = 
p-employee id 
ORDER BY department name, last name, 
project number 


(J DEPARTMENT NAME |] LAST_NAME |B} PROJ_NO 


1 Accounting Hernandez P1011 
2 Maintenance Hardy (null) 
3 Operations (null) null) 
4 Payroll Aaronsan P1012 
5 Payroll Simonlan P1012 
& Payroll Smith P1012 
7 Personnel Jones (null) 
6 Personnel O'Leary P1011 


A SELECT statement that joins the three tables using full outer joins 


SELECT department_name, last_name, 
project_number AS proj_no 
FROM departments dpt 
FULL JOIN employees emp 
ON dpt.department_number = 
emp.department_number 
FULL JOIN projects prj 
ON emp.employee id = 
prj.employee id 
ORDER BY department_name 


Figure 4-10 


È DEPARTMENT_NAME |] Last Name |E PROJ_NO 


1 Accounting Hernandez P1014 
2 Maintenance Hardy (null) 
3 Operations (null) (null) 
4 Payroll Simonian P1012 
5 Payroll Smith P1012 
6 Payroll Aaronsan P1012 
7 Personnel Jones rull) 
6 Personnel O'Leary P1011 
9 (null) Locario P1013 
10 (ruil) (null) P1014 
44 grull) Watson P1013 


Outer joins that join more than two tables 


How to retrieve data from two or more tables 
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How to use the implicit outer join syntax 


When you’re writing new queries that use outer joins, you may use either 
the explicit outer join syntax described in figure 4-8 or the implicit outer join 
syntax shown in figure 4-11. However, to make your code easier to read and 
maintain, you should try to be consistent with the syntax that’s used elsewhere 
in the application you are working on. 

Note, however, that you can’t perform a full outer join using the implicit 
syntax. As a result, if you need to perform a full outer join, you’ll have to use 
the explicit outer join syntax. 


Chapter4 How to retrieve data from two or more tables 


The implicit syntax for an outer join 


SELECT select list 

FROM table 1, table 2 [, table 3]... 

WHERE table _1.column_ name [(+)] table_2.column name [(+)] 
[table_2.column_name [(+)] table_3.column_name [(+)]]... 


A SELECT statement that joins two tables using a left outer join 


SELECT department_name AS dept_name, 
dpt.department number AS dept_no, 


last_name 


FROM departments dpt, employees emp 
WHERE dpt.department_number = emp.department_number (+) 
ORDER BY department _name 


The result set 


DEPT_NAME |) DEPT_No | LAST_NAME 


1 Accounting 


2 Maintenance 
3 Operations 
4 Payroll 

5 Payroll 

6 Payroll 

7 Personnel 

6 Personnel 


(8 rows selected) 


1 Hernandez 
5 Hardy 

3 Cull) 

2 Slmonian 

2 Aaronsen 
2 Smith 

4 Jones 

4 O'Leary 


A SELECT statement that joins two tables using a right outer join 


SELECT department_name AS dept_name, 
emp.department number AS dept_no, 


last_name 


FROM departments dpt, employees emp 
WHERE dpt.department_number (+) = emp.department_number 
ORDER BY department _name 


The result set 


DEPT_NAME |A DEPT_No |B) LAST_NAME 


1 Accounting 
2 Maintenance 
3 Payroll 

4 Payroll 


5 Payroll 

6 Personnel 
? Personnel 
8 (null) 

a cnul) 


(9 rows selected) 


Description 


1 Hernandez 
5 Hardy 

2 Aaronsen 
2 Simonian 

2 Smith 

4 Jones 

4 O'Leary 

6 Locarla 

6 Watson 


e The implicit syntax for outer joins is an alternative to the SQL-92 standards. 


Figure 4-11 


How to use the implicit outer join syntax 
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Other skills for working with joins 


The topics that follow present other skills for working with joins. In the first 
topic, you’ll learn how to use inner and outer joins in the same statement. Next, 
you'll learn how to join tables with the USING and NATURAL keywords. 
Then, you’ll learn how to use another type of join, called a cross join. 


How to combine inner and outer joins 


Figure 4-12 shows how you can combine inner and outer joins. In this 
example, the Departments table is joined with the Employees table using an 
inner join and the Employees table is joined to the Projects table with a left 
outer join. The result is a table that includes all of the departments that have 
employees assigned to them, all of the employees assigned to those depart- 
ments, and the projects those employees are assigned to. Here, you can clearly 
see that two employees, Hardy and Jones, haven’t been assigned projects. 
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The Departments table The Employees table 
J DEPARTMENT NUMBER |] DEPARTMENT NAME F eMpLoyvee_o |) FIRST_NAME|[) LasT_NAME |A DEPARTMENT _NUMBER 
1 Accounting 1 Cindy Smith 
2 Payroll 2 Elmer Jones 
3 Operations 3 Ralph Simonian 
4 Personnel 4 Olivia Hernandez 
5 Maintenance 5 Robert Aaronsen 
6 Denise Watson 
: 7 Thomas Hardy 
The Projects table B Rhea O'Leary 


9 Paulo Locario 


FROJECT_NUMBER EMPLOYEE JD 


A SELECT statement that combines an outer and an inner join 


SELECT department_name AS dept_name, last_name, project_number 
FROM departments dpt 
JOIN employees emp 
ON dpt.department_number = eəemp.department_number 
LEFT JOIN projects prj 
ON emp.employee id = prj.employee id 
ORDER BY department_name 


The result set 


DEPT_NAME |] LAST_NAME |) PROJECT_NUMBER 
1 Accounting Hernandez P1011 
2 Maintenance Hardy Crull) 
3 Payroll Simonian P1012 


4 Payroll Smith P1012 
5 Payroll Aaronsen P1012 
6 Personnel Jones grull} 
7 Personnel O'Leary PF41011 


(7 rows selected) 


Description 


e You can combine inner and outer joins within a single SELECT statement using either 
the explicit or the implicit join syntax. 


Figure 4-12 How to combine inner and outer joins 
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How to join tables with the USING keyword 


When you use the equals operator to join two tables on a common column, 
the join can be referred to as an equijoin (or an equi-join). When you code an 
equijoin, it’s common for the columns that are being compared to have the same 
name. For joins like these, you can simplify the query with the USING keyword 
that was introduced with Oracle 9i. In other words, you can code a USING 
clause instead of an ON clause to specify the join as shown in figure 4-13. 

Here, the first example shows how to join the Vendors and Invoices tables 
on the vendor_id column with a USING clause. This returns the same results as 
the query shown in figure 4-1 that uses the ON clause. Note that the USING 
clause only works because the vendor_id column exists in both the Vendors and 
Invoices tables. 

The second example shows how to join the Departments, Employees, and 
Projects tables with the USING keyword. Here, the first USING clause uses an 
inner join to join the Departments table to the Employees table on the 
department_number column. Then, the second USING clause uses a left join to 
join the Employees table to the Projects table on the employee_id column. This 
shows that you can use a USING clause for inner and outer joins, and it returns 
the same result as the query shown in figure 4-12. 

In some rare cases, you may want to join a table by multiple columns. To do 
that with a USING clause, you can code multiple column names within the 
parentheses, separating each column name with a comma. This yields the same 
result as coding two equijoins connected with the AND operator. 

Since the USING clause is more concise than the ON clause, it can make 
your code easier to read and maintain. As a result, it often makes sense to use 
the USING clause when you’re developing new statements. However, if you 
can’t get the USING clause to work correctly for a complex query, you can 
always use an ON clause instead. 


Chapter4 How to retrieve data from two or more tables 


The syntax for a join that uses the USING keyword 


SELECT select list 
FROM table _1 
[{LEFT|RIGHT|FULL} [OUTER]] JOIN table 2 
USING (join_column_1[, join_column_2]...) 
[ [{LEFT | RIGHT |FULL} [OUTER]] JOIN table 3 
USING (join_column_2[, join_column_2]...)]... 


A SELECT statement that uses the USING keyword to join two tables 


SELECT invoice_number, vendor_name 
FROM vendors 

JOIN invoices USING (vendor_id) 
ORDER BY invoice_number 


The result set 
INYOICE_NUMBER VENDOR_NAME 
1 0-2058 Malloy Lithagraphing Inc 
2 0-2060 Malloy Lithagraphing Inc 


3 0-2436 Malloy Lithographing Inc 
4 1-200-5164 Federal Express Corporation 


(114 rows selected) 


A SELECT statement that uses the USING keyword to join three tables 


SELECT department name AS dept_name, last_name, project_number 
FROM departments 

JOIN employees USING (department_number) 

LEFT JOIN projects USING (employee _ id) 
ORDER BY department _name 


The result set 


B DEPT_NAME |i LAST_NAME |i PROJECT_NUMBER 
1 Accounting Hernandez P1014 
2 Maintenance Hardy (null) 
3 Payroll Slmonlan P1012 
4 Payroll Smith P4012 


5 Payroll Aaronsen P1012 


6 Personnel Jones grull} 
7 Personnel O'Leary P1011 


(7 rows selected) 


Description 


You can use the USING keyword to simplify the syntax for joining tables. 

The join can be an inner join or an outer join. 

The tables must be joined by a column that has the same name in both tables. 

If you want to include multiple columns, you can separate each table with a comma. 


The join must be an equijoin, which means that the equals operator is used to compare 
the two columns. 


Figure 4-13 How to join tables with the USING keyword 
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How to join tables with the NATURAL keyword 


Figure 4-14 shows how to use NATURAL keyword that was introduced 
with Oracle 9i to code a natural join. When you code a natural join, you don’t 
specify the column that’s used to join the two tables. Instead, the database 
automatically joins the two tables based on all columns in the two tables that 
have the same name. As a result, this type of join only works correctly if the 
database is designed in a certain way. 

For instance, if you use a natural join to join the Vendors and Invoices tables 
as shown in the first example, the join works correctly because these tables only 
have one column in common: the vendor_id column. As a result, the database 
joins these two tables on the vendor_id column. However, if these tables had 
another column in common, this query would attempt to join these tables on 
both columns and would yield unexpected results. 

Although natural joins are easy to code, these joins don’t explicitly specify 
the join column. As a result, they might not work correctly if the structure of the 
database changes later. Because of that, you’ll usually want to avoid using 
natural joins for production code. 

In addition, if you use natural joins, you may get unexpected results for 
more complex queries. In that case, you can use the USING or ON clauses to 
explicitly specify the join since these clauses give you more control over the 
join. If necessary, you can mix a natural join with the USING or ON clauses 
within a single SELECT statement. In this figure, for example, the second 
SELECT statement uses a natural join for the first join and uses a USING clause 
for the second join. The result is the same as the result for the second statement 
in figure 4-13. 
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The syntax for a join that uses the NATURAL keyword 


SELECT select list 

FROM table 1 
NATURAL JOIN table 2 
[NATURAL JOIN table 3]... 


A SELECT statement that uses the NATURAL keyword to join tables 


SELECT invoice_number, vendor_name 
FROM vendors 

NATURAL JOIN invoices 
ORDER BY invoice number 


The result set 


INVOICE_NUMBER |] VENDOR_NAME 
4 0-2058 Malloy Lihographing Inc 


2 0-2060 Malloy Lithagraphing Inc 
3 0-2436 Malloy Lithographing Inc 
4 141-200-5164 Federal Express Corporation 


(114 rows selected) 


A SELECT statement that uses the NATURAL keyword to join three tables 


SELECT department name AS dept_name, last_name, project_number 
FROM departments 

NATURAL JOIN employees 

LEFT JOIN projects USING (employee _ id) 
ORDER BY department _name 


The result set 


a LAST_NAME PROJECT_NUMBER 
1 Accounting Hernandez P1011 
2 Malrtanance Hardy (null) 
3 Payroll Slmonian P1012 


4 Payroll Smith P1012 
5 Payroll åaronsen P1012 
6 Personnel Jones Cull) 
7 Personnel O'Leary P1014 


(7 rows selected) 


Description 


You can use the NATURAL keyword to create a natural join that simplifies the syntax 
for joining tables. A natural join can be used to join two tables based on all columns in 
the two tables that have the same name. 


Although the code for a natural join is shorter than the code for joins that use the ON or 
USING clauses, a natural join only works correctly for certain types of database struc- 
tures. In addition, a natural join often yields unexpected results for complex queries. As a 
result, it’s more common to use the ON or USING clauses to join tables. 


Figure 4-14 How to join tables with the NATURAL keyword 
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How to use cross joins 


A cross join produces a result set that includes each row from the first table 
joined with each row from the second table. The result set is known as the 
Cartesian product of the tables. Figure 4-15 shows how to code a cross join 
using either the explicit or implicit syntax. 

To use the explicit syntax, you include the CROSS JOIN keywords between 
the two tables in the FROM clause. Notice that because of the way a cross join 
works, you don’t include a join condition. The same is true when you use the 
implicit syntax. In that case, you simply list the tables in the FROM clause and 
omit the join condition from the WHERE clause. 

The two SELECT statements in this figure illustrate how cross joins work. 
Both of these statements combine data from the Departments and Employees 
tables. As you can see, the result is a table that includes 45 rows. That’s each of 
the five rows in the Departments table combined with each of the nine rows in 
the Employees table. Although this result set is relatively small, you can imag- 
ine how large it would be if the tables included hundreds or thousands of rows. 

As you study these examples, you should realize that cross joins have few 
practical uses. As a result, you’ll rarely, if ever, need to use one. In fact, you’re 
most likely to code a cross join by accident if you use the implicit join syntax 
and forget to code the join condition in the WHERE clause. That’s one of the 
reasons why it’s generally considered a good practice to use the explicit join 
syntax. 
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How to code a cross join using the explicit syntax 


The explicit syntax for a cross join 


SELECT select_list 
FROM table_1 CROSS JOIN table 2 


A cross join that uses the explicit syntax 

SELECT departments.department_number, department_name, employee_id, 
last_name 

FROM departments CROSS JOIN employees 

ORDER BY departments.department number 


How to code a cross join using the implicit syntax 


The implicit syntax for a cross join 
SELECT select_list 

FROM table_1, table 2 

A cross join that uses the implicit syntax 


SELECT departments.department_number, department name, employee id, 
last_name 

FROM departments, employees 

ORDER BY departments .department_number 


The result set 


DEPARTMENT _NUMBER Fl DEPARTMENT_NAME |È) EmpLovee o |i] LAST_NAME 


1 1 Accounting 4 Hernandez * 
2 1 Accounting 3 Simonian 

3 1 Accounting 9 Locari 

4 1 Accounting 8 O'Leary 

5 1 Accounting 7 Hardy 

B 4 Accounting 6 Watson 

T 1 Accounting 5 Aaronsen | 


{45 rows selected) 


Description 


e Across join joins each row from the first table with each row from the second table. The 
result set returned by a cross join is known as a Cartesian product. 


e To code a cross join using the explicit syntax, use the CROSS JOIN keywords in the 
FROM clause. 


e To code a cross join using the implicit syntax, list the tables in the FROM clause and 
omit the join condition from the WHERE clause. 


Figure 4-15 How to use cross joins 
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How to work with unions 


Like a join, a union combines data from two or more tables. Instead of 
combining columns from base tables, however, a union combines rows from 
two or more result sets. You'll see how that works in the topics that follow. 


The syntax of a union 


Figure 4-16 shows how to code a union. As the syntax shows, you create a 
union by connecting two or more SELECT statements with the UNION key- 
word. For this to work, the result of each SELECT statement must have the 
same number of columns, and the data types of the corresponding columns in 
each table must be compatible. 

In this syntax, we have indented all of the SELECT statements that are 
connected by the UNION operator to make it easier to see how this statement 
works. However, in a production environment, it’s common to see the SELECT 
statements and the UNION operator coded at the same level of indentation. 

If you want to sort the result of a union operation, you can code an ORDER 
BY clause after the last SELECT statement. Note that the column names you 
use in this clause must be the same as those used in the first SELECT statement. 
That’s because the column names you use in the first SELECT statement are the 
ones that are used in the result set. 

By default, a union operation removes duplicate rows from the result set. If 
that’s not what you want, you can include the ALL keyword. In most cases, 
though, you’ll omit this keyword. 


Unions that combine data from different tables 


The example in this figure shows how to use a union to combine data from 
two different tables. In this case, the Active_Invoices table contains invoices 
with outstanding balances, and the Paid_Invoices table contains invoices that 
have been paid in full. Both of these tables have the same structure as the 
Invoices table you’ve seen in previous figures. 

This union operation combines the rows in both tables that have an invoice 
date on or after June 1, 2008. Notice that the first SELECT statement includes a 
column named Source that contains the literal value “Active.” The second 
SELECT statement includes a column by the same name, but it contains the 
literal value “Paid.” This column is used to indicate which table each row in the 
result set came from. 

Although this column is assigned the same name in both SELECT state- 
ments, you should realize that doesn’t have to be the case. In fact, none of the 
columns have to have the same names. Corresponding columns do have to have 
compatible data types. But the corresponding relationships are determined by 
the order in which the columns are coded in the SELECT clauses, not by their 
names. When you use column aliases, though, you’ll typically assign the same 
name to corresponding columns so that the statement is easier to understand. 


Chapter4 How to retrieve data from two or more tables 


The syntax for a union operation 


SELECT statement_1 
UNION [ALL] 

SELECT statement 2 
[UNION [ALL] 

SELECT statement 3]... 
[ORDER BY order_by list] 


A union that combines invoice data from two different tables 


SELECT 'Active' AS source, invoice number, invoice date, invoice_total 
FROM active invoices 
WHERE invoice _ date >= '01-JUN-2008' 

UNION 
SELECT 'Paid' AS source, invoice number, invoice date, invoice total 
FROM paid invoices 
WHERE invoice date >= '01-JUN-2008' 

ORDER BY invoice total DESC 


The result set 


SOURCE INYOICE_NUMBER |INYOICE_DATE INYOICE_TOTAL 
1 Active 40318 -J 21842 
2 Paid PO2-3772 -J 7125.34 
3 Paid 10843 mJ 4901,26 
4 Paid 77290 1750 


5 Pald RTR-72-3662-X 1600 
6 Pald 7§C-90227 1367.5 
7 Pald PO2-88D77S7 856.92 
8 Active 77271-001 562 
9 Active 2982771 503.2 


(22 rows selected) 


Description 


A union combines the result sets of two or more SELECT statements into one result set. 


Each result set must return the same number of columns, and the corresponding columns 
in each result set must have compatible data types. 

By default, a union eliminates duplicate rows. If you want to include duplicate rows, 
code the ALL keyword. 

The column names in the final result set are taken from the first SELECT clause. Col- 
umn aliases assigned by the other SELECT clauses have no effect on the final result set. 
To sort the rows in the final result set, code an ORDER BY clause after the last SELECT 


statement. This clause must refer to the column names assigned in the first SELECT 
clause. 


Figure 4-16 How to combine data from different tables 
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Unions that combine data from the same table 


Figure 4-17 shows how to use unions to combine data from a single table. 
In the first example, rows from the Invoices table that have a balance due are 
combined with rows from the same table that are paid in full. As in the example 
in the previous figure, a column named Source is added at the beginning of each 
interim table. That way, the final result set indicates whether each invoice is 
active or paid. 

The second example in this figure shows how you can use a union with data 
that’s joined from two tables. Here, each SELECT statement joins data from the 
Invoices and Vendors tables. The first SELECT statement retrieves invoices with 
totals greater than $10,000. Then, it calculates a payment of 33% of the invoice 
total. The two other SELECT statements are similar. The second one retrieves 
invoices with totals between $500 and $10,000 and calculates a 50% payment. 
And the third one retrieves invoices with totals less than $500 and sets the 
payment amount at 100% of the total. Although this is unrealistic, it helps 
illustrate the flexibility of union operations. 

Notice in this example that the same column aliases are assigned in each 
SELECT statement. Although the aliases in the second and third SELECT 
statements have no effect on the query, they make the query easier to read. In 
particular, it makes it easy to see that the three SELECT statements have the 
same number and types of columns. 
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A union that combines information from the Invoices table 


SELECT 'Active' AS source, invoice number, invoice date, invoice total 
FROM invoices 
WHERE (invoice total - payment_total - credit _total) > 0 

UNION 
SELECT 'Paid' AS source, invoice number, invoice date, invoice total 
FROM invoices 
WHERE (invoice total - payment_total - credit total) <= 0 

ORDER BY invoice total DESC 


The result set 
source | INVOICE NUMBER |INVoIcE_paTe | INVoICE_ToTAL 
1 Pald 0-2058 08-MAY-08 37986.19 
2 Pald p-0258 16-APR-08 268614 


3 Pald 0-2060 08-M4 -06 23517.56 
4 Active 40318 18-JUL-08 21842 
5 Active P-0608 11-APR-08 20551.18 
6 Active 0-2436 O7-MaY-08 10976.06 


(114 rows selected) 


A union that combines payment data from the same joined tables 


SELECT invoice_number, vendor_name, '33% Payment' AS payment_type, 
invoice_total AS total, (invoice_total * 0.333) AS payment 
FROM invoices JOIN vendors 
ON invoices.vendor_id = vendors.vendor_id 
WHERE invoice total > 10000 
UNION 
SELECT invoice number, vendor_name, '50% Payment’ AS payment _type, 
invoice total AS total, (invoice total * 0.5) AS payment 
FROM invoices JOIN vendors 
ON invoices.vendor_id = vendors.vendor_id 
WHERE invoice total BETWEEN 500 AND 10000 
UNION 
SELECT invoice number, vendor_name, 'Full amount’ AS payment_type, 
invoice_total AS Total, invoice_total AS Payment 
FROM invoices JOIN vendors 
ON invoices.vendor_id = vendors.vendor_id 
WHERE invoice total < 500 
ORDER BY payment_type, vendor _name, invoice_number 


The result set 


I) INVOICE_NUMBER | VENDOR_NAME PAYMENT_TYPE |W 
1 40316 Data Reproductlons Corp 33% Payment 21842 7273.388 
2 0-2058 Malloy Lithographing Inc 33% Payment 37988.19 12642.74127 
3 0-2080 Malloy Lithographing Inc 33% Payment 23517.58 7831.35414 


4 0.2436 Malloy Lithographing Inc 33% Payment 10978.08 3855.027986 
5 P-0259 Malloy Lithographing Inc 33% Payment 26881 4 8951.5062 
6 P-O608 Malloy Lithographing Inc 33% Payment 20551.18  6843.54294 
7 509786 Bertelsmann Industry Svcs. Inc 50% Payment 6940.25 3470.125 


(114 rows selected) 


Figure 4-17 Unions that combine data from the same table 
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How to use the MINUS and INTERSECT operators 


Like the UNION operator, the MINUS and INTERSECT operators work 
with two or more result sets, as shown in figure 4-18. Because of that, all three 
of these operators can be referred to as set operators. In addition, the MINUS 
and INTERSECT operators follow many of the same rules as the UNION 
operator. 

The first query shown in this figure uses the MINUS operator to return the 
first and last names of all customers in the Customers table except any custom- 
ers whose first and last names also exist in the Employees table. Since Thomas 
Hardy is the only name that’s the same in both tables, this is the only record 
that’s excluded from the result set for the query that comes before the MINUS 
operator. 

The second query shown in this figure uses the INTERSECT operator to 
return the first and last names of all customers in the Customers table whose 
first and last names also exist in the Employees table. Since Thomas Hardy is 
the only name that exists in both tables, this is the only record that’s returned for 
the result set for this query. 

When you use the MINUS and INTERSECT operators, you must follow 
many of the same rules for working with the UNION operator. To start, both of 
the statements that are connected by these operators must return the same 
number of columns. In addition, the data types for these columns must be 
compatible. Finally, when two queries are joined by a MINUS or INTERSECT 
operator, the column names in the final result set are taken from the first query. 
When you code the ORDER BY clause, you can specify the column names in 
the first query (customer_last_name in our example), or you can specify the 
column position (2). If you understand how the UNION operator works, you 
shouldn’t have any trouble understanding these rules. 
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The syntax for the MINUS and INTERSECT operations 


SELECT statement_1 
{MINUS | INTERSECT} 

SELECT statement 2 
[ORDER BY order_by list] 


The Customers table The Employees table 


CUSTOMER FIRST NAME |E CUSTOMER _LAST_MAME 


Ane Truilla 

Antonio Moreno Simonian 
Thomas: Hardy Ollvia Hernandez 
Christina Berglund Robert Aaronsen 


Andars 


Moos Watson 


(24 rows selected) (9 rows selected) 


A query that excludes rows from the first query if they also occur in the 
second query 


SELECT customer_first_name, customer_last_name 
FROM customers 
MINUS 
SELECT first_name, last_name 
FROM employees 
ORDER BY customer_last_name 


The result set 
CUSTOMER_FIRST_NAME CUSTOMER_LAST_NAME 
1 Maria Anders 
2 Christina Berglund 


3 årt Braunschwelger 
4 Donna Chelan 


(23 rows selected) 


A query that only includes rows that occur in both queries 


SELECT customer first name, customer_last_name 
FROM customers 
INTERSECT 
SELECT first_name, last_name 
FROM employees 


The result set 


I) CUSTOMER_FIRST_NAME | CUSTOMER_LAST_NAME 
1 Thomas Hardy 


(1 rows selected) 


Description 


The number of columns must be the same in both SELECT statements. 
The data types for each column must be compatible. 
The column names in the final result set are taken from the first SELECT statement. 


Figure 4-18 | Howto use the MINUS and INTERSECT operators 
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Perspective 


In this chapter, you learned a variety of techniques for combining data from 
two or more tables into a single result set. In particular, you learned how to use the 
SQL-92 syntax for combining data using inner joins. Of all the techniques pre- 
sented in this chapter, this is the one you’ll use most often. So you’ll want to be 
sure you understand it thoroughly before you go on. 


Terms 


join 

join condition 
inner join 

ad hoc relationship 
qualified column name 
explicit syntax 
table alias 

schema 

user 

schema owner 
self-join 

implicit syntax 


Exercises 


theta syntax 
outer join 

left outer join 
right outer join 
full outer join 
equijoin 
natural join 
cross join 
Cartesian product 
union 

set operator 


1. Write a SELECT statement that returns all columns from the Vendors table 
inner-joined with all columns from the Invoices table. 


2. Write a SELECT statement that returns four columns: 


vendor_name 
invoice_number 
invoice_date 


balance_due 


vendor_name from the Vendors table 
invoice_number from the Invoices table 
invoice_date from the Invoices table 


invoice_total minus payment_total minus credit_total from 
the Invoices table 


The result set should have one row for each invoice with a non-zero balance. 
Sort the result set by vendor_name in ascending order. 


3. Write a SELECT statement that returns three columns: 


vendor_name 
default_account 
description 


vendor_name from the Vendors table 
default_account_number from the Vendors table 


account_description from the General_Ledger_Accounts 
table 


The result set should have one row for each vendor, and it should be sorted by 
account_description and then by vendor_name. 
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Write a SELECT statement that returns five columns from three tables: 
vendor_name vendor_name from the Vendors table 

invoice_date invoice_date from the Invoices table 
invoice_number  invoice_number from the Invoices table 

li_sequence invoice_sequence from the Invoice_Line_Items table 
li_amount line_item_amt from the Invoice_Line_Items table 


Use these aliases for the tables: Ven for the Vendors table, Inv for the Invoices 
table, and LI for the Invoice_Line_Items table. Also, sort the final result set by 
vendor_name, invoice_date, invoice_number, and invoice_sequence. 


Write a SELECT statement that returns three columns: 


vendor_id vendor_id from the Vendors table 
vendor_name vendor_name from the Vendors table 
contact_name A concatenation of vendor_contact_first_name and 


vendor_contact_last_name with a space in between 


The result set should have one row for each vendor whose contact has the same 
last name as another vendor’s contact, and it should be sorted by 
vendor_contact_last_name. Hint: Use a self-join. 


Write a SELECT statement that returns two columns from the 
General_Ledger_Accounts table: account_number and account_description. The 
result set should have one row for each account number that has never been 
used. Sort the final result set by account_number. Hint: Use an outer join to the 
Invoice_Line_lItems table. 


Use the UNION operator to generate a result set consisting of two columns from 
the Vendors table: vendor_name and vendor_state. If the vendor is in California, 
the vendor_state value should be “CA”; otherwise, the VendorState value should 
be “Outside CA.” Sort the final result set by vendor_name. 
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How to code summary 
queries 


In this chapter, you’ll learn how to code queries that summarize data. For 
example, you can use summary queries to report sales totals by vendor or state, 
or to get a count of the number of invoices that were processed each day of the 
month. You’ll also learn how to use a special type of function called an 
aggregate function. Aggregate functions allow you to do jobs like calculate 
averages, summarize totals, or find the highest value for a given column. 


How to work with aggregate functions ...........s:sssesssesee 160 
How:to'codeiaggregate functions iniuriis utes 160 
Queries that use aggregate functions .......sssssserssreersssserasescssterssesetssversarees 162 
How to group and summarize data .....ss.nsnsssnnnunnnnnnnnnnnnnnnnn 164 
How to code the GROUP BY and HAVING clauses ...sssesssessorssereorssrrseresss 164 
Queries that use the GROUP BY and HAVING clause ..........cccsscsscscsseess 166 
How the HAVING clause compares to the WHERE clause ...........ccsccssees 168 
How to code complex search conditions ......sessesssrssssessersessosrsnnsereersunsones 170 
How to summarize data using Oracle extensions .......... 172 
How to use the ROLLUP operator .........ssserserscsssssesscsesscenseneensssssrsnesnveneas 172 
Howstoise the GUBE operators m iana A aeeasa ts cots ee oases 174 
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How to work with aggregate functions 


In chapter 3, you were introduced to scalar functions, which operate on a 
single value and return a single value. In this chapter, you’ll learn how to use 
aggregate functions, which operate on a series of values and return a single 
summary value. Because aggregate functions typically operate on the values in 
columns, they are sometimes referred to as column functions. A query that 
contains one or more aggregate functions is typically referred to as a summary 


query. 
How to code aggregate functions 


Figure 5-1 presents the syntax of the most common aggregate functions. 
Since the purpose of these functions is self-explanatory, we’ll focus mainly on 
how you use them. 

All of these functions but one operate on an expression. In the query in this 
figure, for example, the expression that’s coded for the SUM function calculates 
the balance due of an invoice using the invoice_total, payment_total, and 
credit_total columns. The result is a single value that represents the total amount 
due for all the selected invoices. If you look at the WHERE clause in this 
example, you’ll see that it includes only those invoices with a balance due. 

In addition to an expression, you can also code the ALL or DISTINCT 
keyword in these functions. ALL is the default, which means that all values are 
included in the calculation. The exceptions are null values, which are always 
excluded from these functions. 

If you don’t want duplicate values included, you can code the DISTINCT 
keyword. In most cases, you’ll use DISTINCT only with the COUNT function. 
You’ll see an example of that in the next figure. You won’t use it with MIN or 
MAX because it has no effect on those functions. And it doesn’t usually make 
sense to use it with the AVG and SUM functions. 

Unlike the other aggregate functions, you can’t use the ALL or DISTINCT 
keywords or an expression with COUNT(*). Instead, you code this function 
exactly as shown in the syntax. The value returned by this function is the 
number of rows in the base table that satisfy the search condition of the query, 
including rows with null values. The COUNT(*) function in the query in this 
figure, for example, indicates that the Invoices table contains 40 invoices with a 
balance due. 
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The syntax of the aggregate functions 
Function syntax Result 


AVG([ALL|DISTINCT] expression) The average of the non-null values in the expression. 
SUM([ALL|DISTINCT] expression) The total of the non-null values in the expression. 


MIN([ALL|DISTINCT] expression) The lowest non-null value in the expression. 
MAX([ALL|DISTINCT] expression) The highest non-null value in the expression. 
COUNT ( [ALL | DISTINCT] expression) The number of non-null values in the expression. 
COUNT (*) The number of rows selected by the query. 


A summary query that counts unpaid invoices and calculates the total due 


SELECT COUNT(*) AS number of invoices, 

SUM(invoice total - payment total - credit_total) AS total due 
FROM invoices 
WHERE invoice total - payment_total - credit _total > 0 


i NUMBER_OF INVOICES |] TOTAL_DUE 
1 40 66798.24 


Description 

e Aggregate functions, also called column functions, perform a calculation on the values in 
a set of selected rows. You specify the values to be used in the calculation by coding an 
expression for the function’s argument. In many cases, the expression is just the name of 
a column. 

e A SELECT statement that includes an aggregate function can be called a summary 
query. 

e The expression you specify for the AVG and SUM functions must result in a numeric 
value. The expression for the MIN, MAX, and COUNT functions can result in a nu- 
meric, date, or string value. 

e By default, all values are included in the calculation regardless of whether they’re 
duplicated. If you want to omit duplicate values, code the DISTINCT keyword. This 
keyword is typically used only with the COUNT function. 

e All of the aggregate functions except for COUNT(*) ignore null values. 

e Aggregate functions are often used with the GROUP BY clause of the SELECT state- 
ment, which is used to group the rows in a result set. See figure 5-3 for more informa- 
tion. 

e If you code an aggregate function in the SELECT clause, that clause can’t include non- 
aggregate columns from the base table. 


Figure 5-1 How to code aggregate functions 
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Queries that use aggregate functions 


Figure 5-2 presents four more queries that use aggregate functions. Before 
we describe these queries, you should know that with two exceptions, a SE- 
LECT clause that contains an aggregate function can contain only aggregate 
functions. The first exception is if the column specification results in a literal 
value. This is illustrated by the first column in the first two queries in this 
figure. The second exception is if the query includes a GROUP BY clause. 
Then, the SELECT clause can include any columns specified in the GROUP BY 
clause. You’ll see how you use the GROUP BY clause later in this chapter. 

The first two queries in this figure use the COUNT(*) function to count the 
number of rows in the Invoices table that satisfy the search condition. In both 
cases, only those invoices with invoice dates after 1/1/2008 are included in the 
count. In addition, the first query uses the AVG function to calculate the average 
amount of those invoices and the SUM function to calculate the total amount of 
those invoices. In contrast, the second query uses the MIN and MAX functions 
to calculate the minimum and maximum invoice amounts. 

Although the MIN, MAX, and COUNT functions are typically used on 
columns that contain numeric data, they can also be used on columns that 
contain character or date data. In the third query, for example, they’re used on 
the vendor_name column in the Vendors table. Here, the MIN function returns 
the name of the vendor that’s lowest in the sort sequence, the MAX function 
returns the name of the vendor that’s highest in the sort sequence, and the 
COUNT function returns the total number of vendors. Note that since the 
vendor_name column can’t contain null values, the COUNT(*) function would 
have returned the same result. 

The fourth query illustrates how using the DISTINCT keyword can affect 
the result of a COUNT function. Here, the first COUNT function uses the 
DISTINCT keyword to count the number of vendors that have invoices dated 
1/1/2008 or later in the Invoices table. To do that, it looks for distinct values in 
the vendor_id column. In contrast, because the second COUNT function doesn’t 
include the DISTINCT keyword, it counts every invoice that’s dated 1/1/2008 or 
later. Of course, you could accomplish the same thing using the COUNT(*) 
function. COUNT (vendor_id) is used here only to illustrate the difference 
between coding and not coding the DISTINCT keyword. 
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A summary query that uses the COUNT(*), AVG, and SUM functions 


SELECT 'After 1/1/2008' AS selection date, 
COUNT (*) AS number_of invoices, 
ROUND (AVG (invoice total), 2) AS avg_invoice amt, 
SUM(invoice_total) AS total_invoice_amt 

FROM invoices 

WHERE invoice date > '01-JAN-2008! 


SELECTION _DATE F NUMBER OF INVOICES a AMG _INYOICE_AMT a TOTAL_INVOICE_AMT 


1 After 14 2008 114 1879.74 214290.51 


A summary query that uses the MIN and MAX functions 


SELECT 'After 1/1/2008' AS selection date, COUNT(*) AS number of invoices, 
MAX (invoice total) AS highest invoice total, 
MIN(invoice total) AS lowest invoice total 

FROM invoices 

WHERE invoice date > '01-JAN-2008' 


SELECTION_DATE NUMBER _OF INVOICES HIGHEST _INVOICE TOTAL LOWEST INVOICE TOTAL 


1 After 11/2008 37966.19 


A summary query that works on non-numeric columns 


SELECT MIN(vendor_name) AS first vendor, 
MAX (vendor_name) AS last_vendor, 
COUNT (vendor name) AS number_of vendors 
FROM vendors 


FIRST_VENDOR |] LAST_VENDOR | NUMBER_OF_VENDORS 


1 ASC Signs Zylka Design 


A summary query that uses the DISTINCT keyword 


SELECT COUNT (DISTINCT vendor_id) AS number_of vendors, 
COUNT (vendor_id) AS number of invoices, 
ROUND (AVG (invoice_total),2) AS avg _invoice_amt, 
SuM(invoice_ total) AS total_invoice_amt 

FROM invoices 

WHERE invoice date > '01-JAN-2008' 


NUMBER_OF_VENDORS |A NUMBER _OF INVOICES |E] AYG INVOICE AMT TOTAL_INVOICE_AMT 
1 a4 144 1879.74 21420054 = 


Notes 


e Ifyou want to count all of the selected rows, you’ll typically use the COUNT(*) function 
as illustrated by the first two examples above. An alternative is to code the name of any 


column in the base table that can’t contain null values, as illustrated by the third ex- 
ample. 


e If you want to count only the rows with unique values in a specified column, you can 
code the COUNT function with the DISTINCT keyword followed by the name of the 
column, as illustrated in the fourth example. 


Figure 5-2 Queries that use aggregate functions 
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How to group and summarize data 


Now that you understand how aggregate functions work, you’re ready to 
learn how to group data and use aggregate functions to summarize the data in 
each group. To do that, you need to learn about two new clauses of the SELECT 
statement: GROUP BY and HAVING. 


How to code the GROUP BY and HAVING clauses 


Figure 5-3 presents the syntax of the SELECT statement with the GROUP 
BY and HAVING clauses. The GROUP BY clause determines how the selected 
rows are grouped, and the HAVING clause determines which groups are in- 
cluded in the final results. As you can see, these clauses are coded after the 
WHERE clause but before the ORDER BY clause. That makes sense because 
the search condition in the WHERE clause is applied before the rows are 
grouped, and the sort sequence in the ORDER BY clause is applied after the 
rows are grouped. 

In the GROUP BY clause, you list one or more columns or expressions 
separated by commas. Then, the rows that satisfy the search condition in the 
WHERE clause are grouped by those columns or expressions in ascending 
sequence. That means that a single row is returned for each unique set of values 
in the GROUP BY columns. This will make more sense when you see the 
examples in the next figure that group by two columns. For now, take a look at 
the example in this figure that groups by a single column. 

This example calculates the average invoice amount for each vendor who 
has invoices in the Invoices table that average over $2,000. To do that, it groups 
the invoices by vendor_id. Then, the AVG function calculates the average of the 
invoice_total column. Because this query includes a GROUP BY clause, this 
function calculates the average invoice total for each group rather than for the 
entire result set. In that case, the aggregate function is called a vector aggregate. 
In contrast, aggregate functions like the ones you saw earlier in this chapter that 
return a single value for all the rows in a result set are called scalar aggregates. 

The example in this figure also includes a HAVING clause. The search 
condition in this clause specifies that only those vendors with invoices that 
average over $2,000 should be included. Note that this condition must be 
applied after the rows are grouped and the average for each group has been 
calculated. 

In addition to the AVG function, the SELECT clause includes the vendor_id 
column, That makes sense since the rows are grouped by this column. However, 
the columns used in the GROUP BY clause don’t have to be included in the 
SELECT clause. 
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The syntax of the SELECT statement with the GROUP BY and HAVING 
clauses 


SELECT select_list 

FROM table_source 

[WHERE search condition] 
[GROUP BY group_by_ list] 
[HAVING search condition] 
[ORDER BY order _by list] 


A summary query that calculates the average invoice amount by vendor 


SELECT vendor_id, ROUND(AVG(invoice total), 2) AS average_invoice_amount 
FROM invoices 

GROUP BY vendor id 

HAVING AVG(invoice total) > 2000 

ORDER BY average _invoice_amount DESC 


VEMDOR_[D | aAVERAGE_INVOICE_AMOLINT 
110 23979.48 a 
72 10953.66 
104 7125.34 
39 6940.25 
419 4901.26 
122 2575.33 
86 2433 
100 21845 


on On > ù y 


(8 rows selected) 


Description 


e The GROUP BY clause groups the rows of a result set based on one or more columns or 
expressions. It’s typically used in SELECT statements that include aggregate functions. 

e Ifyou include aggregate functions in the SELECT clause, the aggregate is calculated for 
each set of values that result from the columns named in the GROUP BY clause. 

e If you include two or more columns or expressions in the GROUP BY clause, they form 
a hierarchy where each column or expression is subordinate to the previous one. 

e When a SELECT statement includes a GROUP BY clause, the SELECT clause can 
include aggregate functions, the columns used for grouping, and expressions that result 
in a constant value. 

e A group-by list typically consists of the names of one or more columns separated by 
commas. However, it can contain any expression except for those that contain aggregate 
functions. 

e The HAVING clause specifies a search condition for a group or an aggregate. This 


condition is applied after the rows that satisfy the search condition in the WHERE clause 
are grouped. 


Figure 5-3 How to code the GROUP BY and HAVING clauses 
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Queries that use the GROUP BY and HAVING 
clauses 


Figure 5-4 presents three more queries that group data. If you understood 
the query in the last figure, you shouldn’t have any trouble understanding how 
the first query in this figure works. It groups the rows in the Invoices table by 
vendor_id and returns a count of the number of invoices for each vendor. 

The second query in this figure illustrates how you can group by more than 
one column. Here, a join is used to combine the vendor_state and vendor_city 
columns from the Vendors table with a count and average of the invoices in the 
Invoices table. Because the rows are grouped by both state and city, a row is 
returned for each state and city combination. Then, the ORDER BY clause sorts 
the rows by city within state. Without this clause, the rows would be returned in 
no particular sequence. 

The third query is identical to the second query except that it includes a 
HAVING clause. This clause uses the COUNT function to limit the state and 
city groups that are included in the result set to those that have two or more 
invoices. In other words, it excludes groups that have only one invoice. 
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A summary query that counts the number of invoices by vendor 


SELECT vendor_id, COUNT(*) AS invoice qty 
FROM invoices 

GROUP BY vendor_id 

ORDER BY vendor_id 


D venpor_p lE invoice_aty 


(34 rows selected) 


A summary query that calculates the number of invoices and the average 
invoice amount for the vendors in each state and city 


SELECT vendor_state, vendor_city, COUNT(*) AS invoice qty, 
ROUND (AVG (invoice total),2) AS invoice_avg 
FROM invoices JOIN vendors 
ON invoices.vendor_id = vendors.vendor_id 
GROUP BY vendor_state, vendor city 
ORDER BY vendor_state, vendor _ city 
VENDOR_STATE ||) VENDOR_cITY | INVOICE_aTY |H INVOICE_AVG 
Phoenix 862 


Fresno 1208.75 
Los Angeles 503.2 
Oxnard 188 


(20 rows selected) 


A summary query that limits the groups to those with two or more 


invoices 


SELECT vendor state, vendor city, COUNT(*) AS invoice qty, 
ROUND (AVG (invoice _total),2) AS invoice_avg 

FROM invoices JOIN vendors 
ON invoices.vendor_id = vendors.vendor_id 

GROUP BY vendor state, vendor city 

HAVING COUNT(*) >= 2 

ORDER BY vendor state, vendor city 


YENDOR_STATE VENDOR_CITY INYVOICE_QTY INYVOICE_AvG 
Fresno 1208.75 


Oxnard 166 
Pasadena 196.12 
Sacramento 253 


(12 rows selected) 


Note 
e You can use a join with a summary query to group and summarize the data in two or 
more tables. 


Figure 5-4 Queries that use the GROUP BY and HAVING clauses 
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How the HAVING clause compares to the WHERE 
clause 


AS you’ve seen, you can limit the groups included in a result set by coding a 
search condition in the HAVING clause. In addition, you can apply a search 
condition to each row before it’s included in a group. To do that, you code the 
search condition in the WHERE clause just as you would for any SELECT 
statement. To make sure you understand the differences between search condi- 
tions coded in the HAVING and WHERE clauses, figure 5-5 presents two 
examples. 

In the first example, the invoices in the Invoices table are grouped by vendor 
name, and a count and average invoice amount are calculated for each group. 
Then, the HAVING clause limits the groups in the result set to those that have 
an average invoice total greater than $500. 

In contrast, the second example includes a search condition in the WHERE 
Clause that limits the invoices included in the groups to those that have an 
invoice total greater than $500. In other words, the search condition in this 
example is applied to every row. In the previous example, it was applied to each 
group of rows. 

Beyond this, there are also two differences in the expressions that you can 
include in the WHERE and HAVING clauses. First, the HAVING clause can 
include aggregate functions as you saw in the first example in this figure, but 
the WHERE clause can’t. That’s because the search condition in a WHERE 
clause is applied before the rows are grouped. Second, although the WHERE 
clause can refer to any column in the base tables, the HAVING clause can only 
refer to columns included in the SELECT clause. That’s because it filters the 
summarized result set that’s defined by the SELECT, FROM, WHERE, and 
GROUP BY clauses. In other words, it doesn’t filter the base tables. 
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A summary query with a search condition in the HAVING clause 


SELECT vendor name, COUNT(*) AS invoice qty, 
ROUND (AVG (invoice _total),2) AS invoice_avg 
FROM vendors JOIN invoices 
ON vendors.vendor_id = invoices.vendor_id 
GROUP BY vendor _name 
HAVING AVG(invoice total) > 500 
ORDER BY invoice qty DESC 


a YENDOR NAME j INVOICE_AYG 
1 Unted Parcel Service 2575.33 


2 Zylkea Dasign 667.53 
3 Malloy Lithographing Inc 23978.48 
4 IBM 600.06 


(19 rows selected) 


A summary query with a search condition in the WHERE clause 


SELECT vendor_name, COUNT(*) AS invoice_qty, 
ROUND (AVG (invoice_total),2) AS invoice_avg 
FROM vendors JOIN invoices 
ON vendors.vendor_id = invoices.vendor_id 
WHERE invoice_total > 500 
GROUP BY vendor_name 
ORDER BY invoice_qty DESC 


VENDOR_NAME INVOICE_@TY | INVOICE_avG 
1 United Parcel Service 2875.33 
2 Zylka Design 946.67 


3 Malloy Lthographing Inc 23978.46 
4 Ingram 1077.21 


(20 rows selected) 


Description 


When you include a WHERE clause in a SELECT statement that uses grouping and 
aggregates, the search condition is applied before the rows are grouped and the aggre- 
gates are calculated. That way, only the rows that satisfy the search condition are 
grouped and summarized. 

When you include a HAVING clause in a SELECT statement that uses grouping and 
aggregates, the search condition is applied after the rows are grouped and the aggregates 
are calculated. That way, only the groups that satisfy the search condition are included in 
the result set. 

A HAVING clause can only refer to a column included in the SELECT clause. A 
WHERE clause can refer to any column in the base tables. 

Aggregate functions can only be coded in the HAVING clause. A WHERE clause can’t 
contain aggregate functions. 


Figure 5-5 How the HAVING clause compares to the WHERE clause 


169 


170 Section2 The essential SQL skills 


How to code complex search conditions 


You can code compound search conditions in a HAVING clause just as you 
can in a WHERE clause. This is illustrated by the first query in figure 5-6. This 
query groups invoices by invoice date and calculates a count of the invoices and 
the sum of the invoice totals for each date. In addition, the HAVING clause 
specifies three conditions. First, the invoice date must be between 5/1/2008 and 
5/31/2008. Second, the invoice count must be greater than 1. And third, the sum 
of the invoice totals must be greater than $100. 

Because the second and third conditions in the HAVING clause in this 
query include aggregate functions, they must be coded in the HAVING clause. 
The first condition, however, doesn’t include an aggregate function, so it could 
be coded in either the HAVING or WHERE clause. The second statement in this 
figure, for example, shows this condition coded in the WHERE clause. Note 
that the query returns the same result set regardless of where you code this 
condition. 

So how do you know where to code a search condition? In general, I think 
your code will be easier to read if you include all the search conditions in the 
HAVING clause. If, on the other hand, you prefer to code non-aggregate search 
conditions in the WHERE clause, that’s OK, too. 

Since a search condition in the WHERE clause is applied before the rows 
are grouped while a search condition in the HAVING clause isn’t applied until 
after the grouping, you might expect a performance advantage by coding all 
search conditions in the HAVING clause. However, Oracle takes care of this 
performance issue for you when it optimizes the query. To do that, it automati- 
cally moves search conditions to whichever clause will result in the best perfor- 
mance, as long as that doesn’t change the logic of your query. As a result, you 
can code search conditions wherever they result in the most readable code 
without worrying about system performance. 


Chapter5 How to code summary queries 171 


A summary query with a compound condition in the HAVING clause 


SELECT 
invoice_date, 
COUNT (*) AS invoice qty, 
SUM(invoice_ total) AS invoice_sum 
FROM invoices 
GROUP BY invoice date 
HAVING invoice date BETWEEN '01-MAY-2008' AND '31-MAY-2008' 
AND COUNT(*) > 1 
AND SUM(invoice total) > 100 
ORDER BY invoice date DESC 


The same query coded with a WHERE clause 


SELECT 
invoice date, 
COUNT (*) AS invoice_qty, 
SUM(invoice total) AS invoice sum 
FROM invoices 
WHERE invoice date BETWEEN '01-MAY-2008' AND '31-MAY-2008' 
GROUP BY invoice date 
HAVING COUNT(*) > 1 
AND SUM(invoice_total) > 100 
ORDER BY invoice date DESC 


The result set returned by both queries 
D nvoce pate mvorce_ory | voice_sum 


1 31-May-08 3 11557.75 
z 23-W4Y-08 2761.17 


4 20-MAY-08 306.64 


Aia 
5 E 
3 22-MAY-08 2 4425 | 
3 I+] 


{15 rows selected) 


Description 


e ‘You can use the AND and OR operators to code compound search conditions in a 
HAVING clause just as you can in a WHERE clause. 


e Ifa search condition includes an aggregate function, it must be coded in the HAVING 
clause. Otherwise, it can be coded in either the HAVING or the WHERE clause. 
e In most cases, your code will be easier to read if you code all the search conditions in the 


HAVING clause, but you can code non-aggregate search conditions in the WHERE 
clause if you prefer. 


Figure 5-6 How to code complex search conditions 
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How to summarize data using Oracle 
extensions 


So far, this chapter has discussed standard SQL keywords and functions. 
However, you should also know about two extensions that Oracle provides for 
summarizing data: the ROLLUP and CUBE operators. You'll learn how to use 
these operators in the topics that follow. 


How to use the ROLLUP operator 


You can use the ROLLUP operator to add one or more summary rows to a 
result set that uses grouping and aggregates. The two examples in figure 5-7 
illustrate how this works. 

The first example shows how the ROLLUP operator works when you group 
by a single column. Here, the invoices in the Invoices table are grouped by 
vendor_id, and an invoice count and invoice total are calculated for each vendor. 
In addition, because the ROLLUP operator is included in the GROUP BY 
clause, a summary row is added at the end of the result set. This row summa- 
rizes all the aggregate columns in the result set. In this case, it summarizes the 
invoice_count and invoice_total columns. Because the vendor_id column can’t 
be summarized, it’s assigned a null value. 

The second query in this figure shows how the ROLLUP operator works 
when you group by two columns. This query groups the vendors in the Vendors 
table by state and city and counts the number of vendors in each group. Then, in 
addition to a summary row at the end of the result set, summary rows are 
included for each state. 

When you use an ORDER BY clause with the ROLLUP operator, the rows 
are sorted after the summary rows are added. Then, because null values come 
after other values in the Oracle sort sequence, the summary rows come after the 
rows that they summarize. Incidentally, if the ORDER BY clause in the second 
example were omitted, the only difference in the result set would be that the 
cities within each state wouldn’t necessarily be in the right sequence. 

You can also use another function, the GROUPING function, to work with 
null columns in a summary row. However, this function is typically used in 
conjunction with the CASE function, which you’ll learn about in chapter 8. So 
we'll present the GROUPING function in that chapter. 
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A summary query that includes a final summary row 


SELECT vendor_id, COUNT(*) AS invoice count, 
SUM(invoice_total) AS invoice_total 

FROM invoices 

GROUP BY ROLLUP (vendor_id) 


VENDOR_[D |® mvoice_counr |P INVOICE_TOTAL 
1200.12 


564 


856.92 
21927.31 


(35 rows selected) 


A summary query that includes a summary row for each grouping level 


SELECT vendor_state, vendor city, COUNT (*) AS qty vendors 
FROM vendors 
WHERE vendor state IN ('IA', 'NdJ') 
GROUP BY ROLLUP (vendor_state, vendor city) 
ORDER BY vendor_state, vendor city 
YENDOR_STATE ¥ENDOR_CITY OTY_VENDORS 
Fairfield 
Washington 
{null} 


East Brunswick 
Falrfleld 
Washington 
(null) 

(null} 


Description 


You can use the ROLLUP operator in the GROUP BY clause to add summary rows to 
the final result set. A summary is provided for each aggregate column included in the 
select list. All other columns, except the ones that identify which group is being summa- 
rized, are assigned null values. 

The ROLLUP operator adds a summary row for each group specified in the GROUP BY 
clause except for the rightmost group, which is summarized by the aggregate functions. 
It also adds a summary row to the end of the result set that summarizes the entire result 
set. If the GROUP BY clause specifies a single group, only the final summary row is 
added. 


The sort sequence in the ORDER BY clause is applied after the summary rows are 
added. 

When you use the ROLLUP operator, you can’t use the DISTINCT keyword in any of 
the aggregate functions. 

You can also use the GROUPING function with the ROLLUP operator to determine if a 
summary row has a null value assigned to a given column. See chapter 8 for details. 


Figure 5-7 How to use the ROLLUP operator 
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How to use the CUBE operator 


Figure 5-8 shows you how to use the CUBE operator. This operator is 
similar to the ROLLUP operator, except that it adds summary rows for every 
combination of groups. This is illustrated by the two examples in this figure. As 
you can see, these examples are the same as the ones in figure 5-7 except that 
they use the CUBE operator instead of the ROLLUP operator. 

In the first example, the result set is grouped by a single column. In this 
case, a single row is added to the result set that summarizes all the groups. In 
other words, this works the same as it does with the ROLLUP operator. The 
only difference is that the summary row is at the start of the result set instead of 
the end of the result set. 

In the second example, you can see how CUBE differs from ROLLUP when 
you group by two or more columns. In this case, the result set includes a sum- 
mary row for each state just as it did when the ROLLUP operator was used. For 
instance, the third row in this example indicates that there are two vendors in the 
state of Iowa. In addition, though, the CUBE operator includes a summary row 
for each city. For instance, the ninth row in this example indicates that there are 
two vendors for the city of Fairfield. But if you look at the first and fifth rows in 
the result set, you'll see that one of those vendors is in Fairfield, Iowa and one is 
in Fairfield, New Jersey. Similarly, there are two vendors in two different states 
for the city named Washington. There are also two vendors in the city of East 
Brunswick, but both are in New Jersey. 

Here again, the ORDER BY clause is applied after the summary rows are 
added. So in the second example, the ORDER BY clause has two effects. First, 
it insures that the cities are in sequence within the states. Second, it moves the 
four extra summary rows that are created by the CUBE operator from the start 
of the result set to the end of the result set. 

Now that you’ve seen how the CUBE operator works, you may be wonder- 
ing when you would use it. One obvious use is to add a summary row to a result 
set that’s grouped by a single column, but you could just as easily use the 
ROLLUP operator for that. Beyond that, the CUBE operator can occasionally 
get useful information that you can’t get any other way. 
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A summary query that includes a final summary row 


SELECT vendor_id, COUNT(*) AS invoice count, 
SUM(invoice_total) AS invoice_total 

FROM invoices 

GROUP BY CUBE (vendor_id) 


INYOICE_TOTAL 
214290.51 
1200.12 


564 
856.92 


(35 rows selected) 


A summary query that includes a summary row for each set of groups 


SELECT vendor state, vendor city, COUNT(*) AS qty vendors 

FROM vendors 

WHERE vendor state IN ('IA', 'NdJ') 

GROUP BY CUBE(vendor state, vendor city) 

ORDER BY vendor state, vendor city 

VENDOR _STATE ¥YENDOR_CITY EJ QTY_YENDORS 

Fairfield 
Washington 
(null) 
East Brunswick 
Fairfield 
Washington 
(null) 
East Brunswick 
Fairfield 


Washington 
{null} 


aon NNA.. ON ONO - 


Description 


You can use the CUBE operator in the GROUP BY clause to add summary rows to the 
final result set. A summary is provided for each aggregate column included in the select 
list. All other columns, except the ones that identify which group is being summarized, 
are assigned null values. 

The CUBE operator adds a summary row for every combination of groups specified in 
the GROUP BY clause. It also adds a summary row to the end of the result set that 
summarizes the entire result set. 


The sort sequence in the ORDER BY clause is applied after the summary rows are 
added. 


When you use the CUBE operator, you can’t use the DISTINCT keyword in any of the 
aggregate functions. 

You can also use the GROUPING function with the CUBE operator to determine if a 
summary row has a null value assigned to a given column. See chapter 8 for details. 


Figure 5-8 How to use the CUBE operator 
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Perspective 


In this chapter, you learned how to code queries that group and summarize 
data. In most cases, you’ll be able to use the techniques presented here to get the 
summary information you need. 


Terms 


scalar function 
aggregate function 
column function 


summary query 
scalar aggregate 


vector aggregate 


Exercises 
1. Write a SELECT statement that returns one row for each vendor that contains 
these columns from the Invoices table: 
The vendor_id column 
The sum of the invoice_total column 
The result set should be sorted by vendor_id. 


2. Write a SELECT statement that returns one row for each vendor that contains 
these columns: 


The vendor_name column from the Vendors table 
The sum of the payment_total column in the Invoices table. 


The result set should be sorted in descending sequence by the payment 
total sum for each vendor. 


3. Write a SELECT statement that returns one row for each vendor that contains 
three columns: 


The vendor_name column from the Vendors table 

The count of the invoices for each vendor in the Invoices table 

The sum of the invoice_total column for each vendor in the Invoices table 
Sort the result set so the vendor with the most invoices appears first. 


4. Write a SELECT statement that returns one row for each general ledger 
account number that contains three columns: 


The account_description column from the General_Ledger_Accounts table 


The count of the entries in the Invoice_Line_ Items table that have the same 
account_number 
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The sum of the line item amounts in the Invoice_Line_Items table that have 
the same account-number 


Filter the result set to include only those rows with a count greater than 1; group 
the result set by account description; and sort the result set in descending 
sequence by the sum of the line item amounts. 


Modify the solution to exercise 4 to filter for invoices dated in the second quarter 
of 2008 (April 1, 2008 to June 30, 2008). Hint: Join to the Invoices table to code 
a search condition based on invoice_date. 


Write a SELECT statement that answers this question: What is the total amount 
invoiced for each general ledger account number? Use the ROLLUP operator to 
include a row that gives the grand total. Hint: Use the line_item_amt column of 
the Invoice_Line_Items table. 


Write a SELECT statement that answers this question: Which vendors are being 
paid from more than one account’? Return two columns: the vendor name and the 
total number of accounts that apply to that vendor’s invoices. Hint: Use the 
DISTINCT keyword to count the account_number column in the 
Invoice_Line_Items table. 
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How to code subqueries 


A subquery is a SELECT statement that’s coded within another SQL statement. 
As a result, you can use subqueries to build queries that would be difficult or 
impossible to do otherwise. In this chapter, you’ll learn how to use subqueries 
within SELECT statements. Then, in the next chapter, you’ll learn how to use 
them when you code INSERT, UPDATE, and DELETE statements. 
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POG E a S. 208 


180 


Section2 The essential SQL skills 


An introduction to subqueries 


Since you know how to code SELECT statements, you already know how to 
code a subquery. It’s simply a SELECT statement that’s coded within another 
SQL statement. The trick to using subqueries is knowing where and when to use 
them. You’ll learn the specifics of using subqueries throughout this chapter. The 
two topics that follow, however, will give you an overview of where and when 
to use them. 


In figure 6-1, you can see that a subquery can be coded, or introduced, in 
the WHERE, HAVING, FROM, or SELECT clause of a SELECT statement. 
The SELECT statement in this figure, for example, illustrates how you can use a 
subquery in the search condition of a WHERE clause. When it’s used in a 
search condition, a subquery can be referred to as a subquery search condition 
or a subquery predicate. 

The statement in this figure retrieves all the invoices from the Invoices table 
that have invoice totals greater than the average of all the invoices. To do that, 
the subquery calculates the average of all the invoices. Then, the search condi- 
tion tests each invoice to see if its invoice total is greater than that average. 

When a subquery returns a single value as it does in this example, you can 
use it anywhere you would normally use an expression. However, a subquery 
can also return a single-column result set with two or more rows. In that case, it 
can be used in place of a list of values, such as the list for an IN operator. In 
addition, if a subquery is coded within a FROM clause, it can return a result set 
with two or more columns. You’!l learn about all of these different types of 
subqueries in this chapter. 

You can also code a subquery within another subquery. In that case, the 
subqueries are said to be nested. Because nested subqueries can be difficult to 
read, you should use them only when necessary. 
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Four ways to introduce a subquery in a SELECT statement 
1. Ina WHERE clause as a search condition 

2. Ina HAVING clause as a search condition 

3. Inthe FROM clause as a table specification 

4. Inthe SELECT clause as a column specification 


A SELECT statement that uses a subquery in the WHERE clause 


SELECT invoice number, invoice date, invoice total 
FROM invoices 
WHERE invoice total > 
(SELECT AVG(invoice_ total) 
FROM invoices) 
ORDER BY invoice_total 


The value returned by the subquery 
1879.7413 


The result set 


D invoice_NUMBER |) mvoce_pate |) INvoICE_TOTAL 
4 98931 9-487 48-APR-08 4927.54 
2 971522 30-APR-08 4962.13 


3 989319-417 26-APR-08 2051.59 
4 989319-427 25-4PR-08 2115.81 
5 989319-477 19-APR-08 2184.11 


(21 rows) 


Description 

e A subquery is a SELECT statement that’s coded within another SQL statement. 

e A subquery can return a single value, a result set that contains a single column, or a 
result set that contains one or more columns. 


e A subquery that returns a single value can be coded, or introduced, anywhere an expres- 
sion is allowed. A subquery that returns a single column can be introduced in place of a 
list of values, such as the values for an IN phrase. And a subquery that returns one or 
more columns can be introduced in place of a table in the FROM clause. 

e The syntax for a subquery is the same as for a standard SELECT statement. However, a 
subquery doesn’t typically include the GROUP BY or HAVING clause, and it can’t 
include an ORDER BY clause. 

e A subquery that’s used ina WHERE or HAVING clause is called a subquery search 
condition or a subquery predicate. This is the most common use for a subquery. 

e Although you can introduce a subquery in a GROUP BY or ORDER BY clause, you 
usually won’t need to. 

e Subqueries can be nested within other subqueries. However, subqueries that are nested 
more than two or three levels deep can be difficult to read. 


Figure 6-1 How to use subqueries 
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How subqueries compare to joins 


In the last figure, you saw an example of a subquery that returns an aggre- 
gate value that’s used in the search condition of a WHERE clause. This type of 
subquery provides for processing that can’t be done any other way. However, 
most subqueries can be restated as joins, and most joins can be restated as 
subqueries. This is illustrated by the SELECT statements in figure 6-2. 

Both SELECT statements in this figure return a result set that consists of 
selected rows and columns from the Invoices table. In this case, only the in- 
voices for vendors in California are returned. The first statement uses a join to 
combine the Vendors and Invoices tables so the vendor_state column can be 
tested for each invoice. In contrast, the second statement uses a subquery to 
return a result set that consists of the vendor_id column for each vendor in 
California. Then, that result set is used with the IN operator in the search 
condition so that only invoices with a vendor_id in that result set are included in 
the final result set. 

So if you have a choice, which technique should you use? In general, we 
recommend that you use the technique that results in the most readable code. 
For example, a join tends to be more intuitive than a subquery when it uses an 
existing relationship between two tables. That’s the case with the Vendors and 
Invoices tables used in the examples in this figure. On the other hand, a 
subquery tends to be more intuitive when it uses an ad hoc relationship. 

As your queries get more complex, you may find that they’re easier to code 
by using subqueries, regardless of the relationships that are involved. Later in 
this chapter, for example, you’ll see a couple examples of this. 

You should also realize that when you use a subquery in a search condition, 
its results can’t be included in the final result set. For instance, the second 
example in this figure can’t be changed to include the vendor_name column 
from the Vendors table. That’s because the Vendors table isn’t named in the 
FROM clause of the outer query. So if you need to include information from 
both tables in the result set, you need to use a join. 
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A query that uses an inner join 


SELECT invoice number, invoice date, invoice total 
FROM invoices JOIN vendors 

ON invoices.vendor_id = vendors.vendor id 
WHERE vendor_state = 'CA’' 
ORDER BY invoice date 


The same query restated with a subquery 


SELECT invoice number, invoice date, invoice total 
FROM invoices 
WHERE vendor_id IN 
(SELECT vendor_id 
FROM vendors 
WHERE vendor state = 'CA') 
ORDER BY invoice date 


The result set returned by both queries 


I INVOICE_NUMBER | INVOICE_DaTE || INVOICE_TOTAL 
1 QPsee72 25-FEB-08 118.54 
2 Q545443 14-MAR-08 1083.58 


3 MABO1489 16-APR-08 936.93 


How to code subqueries 


4 97/5535 26-APR-08 313.55 


(40 rows) 


Advantages of joins 


e The result of a join operation can include columns from both tables. 


e A join tends to be more intuitive when it uses an existing relationship between the two 


tables, such as a primary key to foreign key relationship. 


Advantages of subqueries 


e ‘You can use a subquery to pass an aggregate value to the outer query. 


e A subquery tends to be more intuitive when it uses an ad hoc relationship between the 


two tables. 


e Long, complex queries can sometimes be easier to code using subqueries. 


Description 


e Like a join, a subquery can be used to code queries that work with two or more tables. 
e Most subqueries can be restated as joins and most joins can be restated as subqueries. 


Figure 6-2 How subqueries compare to joins 
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How to code subqueries in search 
conditions 


You can use a variety of techniques to work with a subquery in a search 
condition. You’ll learn about those techniques in the topics that follow. As you 
read these topics, keep in mind that although all of the examples illustrate the 
use of subqueries in a WHERE clause, all of this information applies to the 
HAVING clause as well. 


How to use subqueries with the IN operator 


In chapter 3, you learned how to use the IN operator to test whether an 
expression is contained in a list of values. One way to provide that list of values 
is to use a subquery. This is illustrated in figure 6-3. 

The example in this figure retrieves the vendors from the Vendors table that 
don’t have invoices in the Invoices table. To do that, it uses a subquery to 
retrieve the vendor_id of each vendor in the Invoices table. The result is a result 
set like the one shown that contains just the vendor_id column. Then, this result 
set is used to filter the vendors that are included in the final result set. 

Note that this subquery returns a single column. That’s a requirement when 
a subquery is used with the IN operator. Note also that the subquery includes the 
DISTINCT keyword. That way, if more than one invoice exists for a vendor, the 
vendor_id for that vendor will be included only once. The keyword DISTINCT 
is optional, however. The final result set will be the same if you include it or 
omit it. 

In the previous figure, you saw that a query that uses a subquery with the IN 
operator can be restated using an inner join. Similarly, a query that uses a 
subquery with the NOT IN operator can typically be restated using an outer 
join. The first query shown in this figure, for example, can be restated as shown 
in the second query. In this case, though, the query with the subquery is more 
readable. 
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The syntax of a WHERE clause that uses an IN phrase with a subquery 
WHERE test _expression [NOT] IN (subquery) 


A query that returns vendors without invoices 


SELECT vendor_id, vendor_name, vendor_state 
FROM vendors 
WHERE vendor_id NOT IN 
(SELECT DISTINCT vendor_id 
FROM invoices) 
ORDER BY vendor_id 


The result of the subquery 


VENDOR D 


(34 rows) 


The result set 


a VENDOR JD 4) VENDOR_NAME VENDOR_STATE 
33 Nelson 
35 Cal Stata Termite 
36 Graylitt 
38 Venture Communications Int'l 


39 Custom Printing Company 
40 Net Assoc of College Stores 


(88 rows) 


The query restated without a subquery 


SELECT v.vendor_id, vendor _name, vendor state 
FROM vendors v LEFT JOIN invoices i 
ON v.vendor_id = i.vendor_id 
WHERE i.vendor_ id IS NULL 
ORDER BY v.vendor_id 


Description 


e You can introduce a subquery with the IN operator to provide the list of values that are 
tested against the test expression. 


e When you use the IN operator, the subquery must return a single column of values. 


e A query that uses the NOT IN operator with a subquery can typically be restated using 
an outer join. 


Figure 6-3 How to use subqueries with the IN operator 
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How to compare the result of a subquery with an 
expression 


Figure 6-4 illustrates how you can use the comparison operators to compare 
an expression with the result of a subquery. In the example, the subquery returns 
the average balance due of the invoices in the Invoices table with a balance due 
greater than zero. Then, it uses that value to retrieve all invoices that have a 
balance due that’s less than the average. 

When you use a comparison operator as shown in this figure, the subquery 
must return a single value. In most cases, that means that it uses an aggregate 
function. However, you can also use the comparison operators with subqueries 
that return two or more values. To do that, you use the SOME, ANY, or ALL 
keyword to modify the comparison operator. You’ll learn more about these 
keywords in the next two topics. 
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The syntax of a WHERE clause that compares an expression with the 
value returned by a subquery 
WHERE expression comparison_operator [SOME|ANY|ALL] (subquery) 


A query that returns invoices with a balance due less than the average 


SELECT inveice_number, invoice date, 

invoice_total - payment_total - credit_total AS balance due 
FROM invoices 
WHERE invoice_total - payment_total - credit_total > 0 

AND invoice total - payment_total - credit total < 


( 


SELECT AvG(invoice total - payment_total - credit total) 
FROM invoices 
WHERE invoice total - payment_total - credit _total > 0 


) 


ORDER BY invoice total DESC 


The value returned by the subquery 


1669.906 


The result set 


INVOICE_NUMBER |A INvoICE_paTE | saLance_ pue 


1 31359763 
z 97/553 

3 77271-001 
4 31361833 
5 9982771 

6 97/5535 


23-MAY -08 1575 a 
27-APR-08 804.14 E 
OS-JLIN-08 682 
23-MAY -08 579.42 
03-JLIN-08 503.2 
26-APR-08 313.55 =| 


(33 rows) 


Description 


e ‘You can use a comparison operator in a search condition to compare an expression with 
the results of a subquery., 


e If you code a search condition without the ANY, SOME, or ALL keyword, the subquery 
must return a single value. 


e If you include the ANY, SOME, or ALL keyword, the subquery can return a list of 
values. See figures 6-5 and 6-6 for more information on using these keywords. 


Figure 6-4 


How to compare the result of a subquery with an expression 
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How to use the ALL keyword 


Figure 6-5 shows you how to use the ALL keyword. This keyword modifies 
the comparison operator so the condition must be true for all the values returned 
by asubquery. This is equivalent to coding a series of conditions connected by 
AND operators. The table at the top of this figure describes how this works for 
some of the comparison operators. 

If you use the greater than operator (>), the expression must be greater than 
the maximum value returned by the subquery. Conversely, if you use the less 
than operator (<), the expression must be less than the minimum value returned 
by the subquery. If you use the equal operator (=), the expression must be equal 
to all of the values returned by the subquery. And if you use the not equal 
operator (<>), the expression must not equal any of the values returned by the 
subquery. Note that a not equal condition could be restated using a NOT IN 
condition. 

The query in this figure illustrates the use of the greater than operator with 
the ALL keyword. Here, the subquery selects the invoice_total column for all 
the invoices with a vendor_id value of 34. This results in a table with two rows, 
as shown in this figure. Then, the outer query retrieves the rows from the 
Invoices table that have invoice totals greater than all of the values returned by 
the subquery. In other words, this query returns all the invoices that have totals 
greater than the largest invoice for vendor number 34. 

When you use the ALL operator, you should realize that if the subquery 
doesn’t return any rows, the comparison operation will always be true. In 
contrast, if the subquery returns only null values, the comparison operation will 
always be false. 

In many cases, a condition with the ALL keyword can be rewritten so it’s 
easier to read and maintain. For example, the condition in the query in this 
figure could be rewritten to use the MAX function like this: 


WHERE invoice total > 
(SELECT MAX(invoice_ total) 
FROM invoices 
WHERE vendor_id = 34) 
Whenever you can, then, we recommend that you replace the ALL keyword 
with an equivalent condition. 
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How the ALL keyword works 


x > ALL (1, 2) x > 2 x must be greater than all the values returned by 
the subquery, which means it must be greater 
than the maximum value. 


x < ALL (1, 2) x<1 x must be less than all the values returned by the 
subquery, which means it must be less than the 
minimum value. 

ALL (1, 2) (x = 1) AND (x = 2) This condition can evaluate to True only if the 
subquery returns a single value or if all the 
values returned by the subquery are the same. 
Otherwise, it evaluates to False. 


x <> ALL (1, 2) (x <> 1) AND (x <> 2) This condition is equivalent to: 
x NOT IN (1, 2) 


A query that returns invoices larger than the largest invoice for vendor 34 


SELECT vendor name, invoice_number, invoice_total 
FROM invoices i JOIN vendors v ON i.vendor_id = v.vendor_id 
WHERE invoice total > ALL 
(SELECT invoice_total 
FROM invoices 
WHERE vendor _id = 34) 
ORDER BY vendor_name 


The result of the subquery 


INVOICE TOTAL 
1 118.54 
2 1083.58 


an 


The result set 


YENDOR_NAME INYOICE_NUMBER |È INYOICE_TOTAL 

1 Bertelsmann Industry Svcs, Inc 509786 6940.25 

2 Cahners Publishing Company 587056 2184.5 

3 Computerworld 367447 2433 

4 Deta Reproductions Corp 40316 21842 
(25 rows) 
Description 


e You can use the ALL keyword to test that a comparison condition is true for all of the 
values returned by a subquery. This keyword is typically used with the comparison 
operators <, >, <=, and >=. 


e Ifno rows are returned by the subquery, a comparison that uses the ALL keyword is 
always true. 


e Ifall of the rows returned by the subquery contain a null value, a comparison that uses 
the ALL keyword is always false. 


Figure 6-5 How to use the ALL keyword 
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How to use the ANY and SOME keywords 


Figure 6-6 shows how to use the ANY and SOME keywords. You use these 
keywords to test if a comparison is true for any, or some, of the values returned 
by a subquery. This is equivalent to coding a series of conditions connected with 
OR operators. Because these keywords are equivalent, you can use whichever 
one you prefer. The table at the top of this figure describes how these keywords 
work with some of the comparison operators. 

The example in this figure shows how you can use the ANY keyword with 
the less than operator. This statement is similar to the one you saw in the 
previous figure, except that it retrieves invoices with invoice totals that are less 
than at least one of the invoice totals for a given vendor. Like the statement in 
the previous figure, this condition could be rewritten using the MAX function, 
as follows: 


WHERE invoice_total «< 
(SELECT MAX (invoice total) 
FROM invoices 
WHERE vendor id = 115) 
Because you can usually replace an ANY condition with an equivalent condition 


that’s more readable, you probably won’t use ANY often. 
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How the ANY and SOME keywords work 


x > ANY (1, 2) x > 1 x must be greater than at least one of the values 
returned by the subquery list, which means that it 
must be greater than the minimum value returned 
by the subquery. 

x < ANY (1, 2) x <2 x must be less than at least one of the values 
returned by the subquery list, which means that it 
must be less than the maximum value returned 


by the subquery. 
x = ANY (1, 2) (x = 1) OR (x = 2) This condition is equivalent to: x IN (1, 2) 
x <> ANY (1, 2) (x <> 1) OR (x <> 2) This condition will evaluate to True for any non- 


empty result set containing at least one non-null 
value that isn’t equal to x. 


A query that returns invoice amounts smaller than the largest invoice 
amount for vendor 115 


SELECT vendor_name, invoice number, invoice_total 
FROM vendors JOIN invoices ON vendors.vendor_id = invoices.invoice id 
WHERE invoice_total < ANY 

(SELECT invoice total 

FROM invoices 

WHERE vendor _id = 115) 


The result of the subquery 


INVOICE_TOTAL 


The result set 


my YENDOR NAME 
1 Boucher Communications ne 248063706 
2 Relter's Sclertific & Pro Books 25022117 


3 Champlon Printing Company 247680512 
4 Opamp Technical Books 21-4748363 
5 Capital Resource Credit 21-4923721 


(17 rows) 


Description 


e You can use the ANY or SOME keyword to test that a condition is true for one or more 
of the values returned by a subquery. ANY and SOME are equivalent keywords. SOME 
is the ANSI-standard keyword, but ANY is more commonly used. 


e Ifno rows are returned by the subquery or all of the rows returned by the subquery 
contain a null value, a comparison that uses the ANY or SOME keyword is always false. 


Figure 6-6 How to use the ANY and SOME keywords 
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How to code correlated subqueries 


The subqueries you’ve seen so far in this chapter have been subqueries that 
are executed only once for the entire query. However, you can also code 
subqueries that are executed once for each row that’s processed by the outer 
query. This type of query is called a correlated subquery, and it’s similar to 
using a loop to do repetitive processing in a procedural programming language. 

Figure 6-7 illustrates how correlated subqueries work. The example in this 
figure retrieves rows from the Invoices table for those invoices that have an 
invoice total that’s greater than the average of all the invoices for the same 
vendor. To do that, the search condition in the WHERE clause of the subquery 
refers to the vendor_id value of the current invoice. That way, only the invoices 
for the current vendor will be included in the average. 

Each time a row in the outer query is processed, the value in the vendor_id 
column for that row is substituted for the column reference in the subquery. 
Then, the subquery is executed based on the current value. If the vendor_id 
value is 95, for example, this subquery will be executed: 

SELECT AVG(invoice_total) 

FROM invoices inv_sub 

WHERE inv_sub.vendor id = 95 

After this subquery is executed, the value it returns is used to determine 
whether the current invoice is included in the result set. For example, the value 
returned by the subquery for vendor 95 is 28.5016. Then, that value is compared 
with the invoice total of the current invoice. If the invoice total is greater than 
that value, the invoice is included in the result set. Otherwise, it’s not. This 
process is repeated until each of the invoices in the Invoices table has been 
processed. 

As you study this example, notice how the column names in the WHERE 
clause of the inner query are qualified to indicate whether they refer to a column 
in the inner query or the outer query. In this case, the same table is used in both 
the inner and outer queries, so aliases have been assigned to the tables. Then, 
those alias names are used to qualify the column names. Although you have to 
qualify a reference to a column in the outer query, you don’t have to qualify a 
reference to a column in the inner query. However, it’s common practice to 
qualify both names, particularly if they refer to the same table. 

Because correlated subqueries can be difficult to code, you may want to test 
a subquery separately before using it within another SELECT statement. To do 
that, however, you’ll need to substitute a constant value for the variable that 
refers to a column in the outer query. That’s what we did to get the average 
invoice total for vendor 95. Once you’re sure that the subquery works on its 
own, you can replace the constant value with a reference to the outer query so 
you can use it within a SELECT statement. 
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A query that uses a correlated subquery to return each invoice amount 
that’s higher than the vendor’s average invoice amount 


SELECT vendor id, invoice number, invoice total 
FROM invoices inv_main 
WHERE invoice total > 

(SELECT AVG(invoice_ total) 

FROM invoices inv_sub 

WHERE inv_sub.vendor_id = inv_main.vendor_ id) 
ORDER BY vendor id, invoice total 


The value returned by the subquery for vendor 95 
28.50166... 


The result set 


I VENDOR D |} INVOICE_NUMBER |Ý INVOICE_TOTAL 
83 31359783 1575 
95 111-92R-10095 32,7 


95 111-92R-10093 39,77 
95 111-82R-10092 46.21 
110 P-0259 268814 


(36 rows) 


Description 


e A correlated subquery is a subquery that is executed once for each row processed by the 
outer query. In contrast, a noncorrelated subquery is executed only once. All of the 
subqueries you’ve seen so far have been noncorrelated subqueries. 


e A correlated subquery refers to a value that’s provided by a column in the outer query. 
Because that value varies depending on the row that’s being processed, each execution of 
the subquery returns a different result. 

e To refer to a value in the outer query, a correlated subquery uses a qualified column 
name that includes the table name from the outer query. If the subquery uses the same 
table as the outer query, you must assign a table alias to one of the tables to remove 
ambiguity. 


Note 


e Because a correlated subquery is executed once for each row processed by the outer 
query, a query with a correlated subquery typically takes longer to run than a query with 
a noncorrelated subquery. 


Figure 6-7 How to code correlated subqueries 
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How to use the EXISTS operator 


Figure 6-8 shows you how to use the EXISTS operator with a subquery. 
This operator tests whether or not the subquery returns a result set. In other 
words, it tests whether the result set exists. When you use this operator, the 
subquery doesn’t actually return a result set to the outer query. Instead, it returns 
an indication of whether any rows satisfy the search condition of the subquery. 
Because of that, queries that use this operator execute quickly. 

You typically use the EXISTS operator with a correlated subquery, as 
illustrated in this figure. This query retrieves all the vendors in the Vendors table 
that don’t have invoices in the Invoices table. Notice that this query returns the 
same vendors as the two queries you saw in figure 6-3 that use the IN operator 
with a subquery and an outer join. However, the query in this figure executes 
more quickly than either of the queries in figure 6-3. 

In this example, the correlated subquery selects all invoices that have the 
same vendor_id value as the current vendor in the outer query. Because the 
subquery doesn’t actually return a result set, it doesn’t matter what columns are 
included in the SELECT clause. So it’s customary to just code an asterisk. 

After the subquery is executed, the search condition in the WHERE clause 
of the outer query uses NOT EXISTS to test whether any invoices were found 
for the current vendor. If not, the vendor row is included in the result set. 
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The syntax of a subquery that uses the EXISTS operator 
WHERE [NOT] EXISTS (subquery) 


A query that returns vendors without invoices 


SELECT vendor_id, vendor_name, vendor state 
FROM vendors 
WHERE NOT EXISTS 
(SELECT * 
FROM invoices 
WHERE invoices.vendor_id = vendors.vendor_id) 


The result set 


VENDOR_ID VENDOR_NAME VENDOR_STATE 
33 Nielson 
35 Cal State Termite 
36 Graylitt 


36 Venture Communications Int'l 
39 Custom Printing Company 
40 Net Assoc of Collage Stores 


(88 rows) 


Description 


e You can use the EXISTS operator to test that one or more rows are returned by the 
subquery. You can also use the NOT operator along with the EXISTS operator to test that 
no rows are returned by the subquery. 


e When you use the EXISTS operator with a subquery, the subquery doesn’t actually 
return any rows. Instead, it returns an indication of whether any rows meet the specified 
condition. 


e Because no rows are returned by the subquery, it doesn’t matter what columns you 
specify in the SELECT clause. So you typically just code an asterisk (*). 


e Although you can use the EXISTS operator with either a correlated or a noncorrelated 
subquery, it’s used most often with correlated subqueries. That’s because it’s usually 
better to use a join than a noncorrelated subquery with EXISTS. 


Figure 6-8 How to use the EXISTS operator 
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Other ways to use subqueries 


Although you’ll typically use subqueries in the WHERE or HAVING clause 
of a SELECT statement, you can also use them in the FROM and SELECT 
clauses. You’ll learn how to do that in the topics that follow. 


How to code subqueries in the FROM clause 


Figure 6-9 shows you how to code a subquery in a FROM clause. As you 
can see, you can code a subquery in place of a table specification. In this 
example, the results of the subquery are joined with another table. When you 
use a subquery in this way, it can return any number of rows and columns. In 
the Oracle documentation, this type of subquery is sometimes referred to as an 
inline view since it works like a view that’s temporarily created and stored in 
memory. 

Subqueries are typically used in the FROM clause to create inline views that 
provide summarized data to a summary query. The subquery in this figure, for 
example, creates an inline view that contains the vendor_id values and the 
average invoice totals for all vendors with invoice averages over 4900. To do 
that, it selects all vendors with invoice averages over 4900, and it groups the 
invoices by vendor_id. The inline view is then joined with the Invoices table, 
and the resulting rows are grouped by vendor_id. Finally, the maximum invoice 
date and average invoice total are calculated for the grouped rows, and the 
results are sorted by the maximum invoice date in descending sequence. 

You should notice three things about this query. First, the inline view is 
assigned a table alias so it can be referred to from the outer query. Second, the 
result of the AVG function in the subquery is assigned a column alias. This is 
because an inline view can’t have unnamed columns. Third, although you might 
think that you could use the average invoice totals calculated by the subquery in 
the select list of the outer query, you can’t. That’s because the outer query 
includes a GROUP BY clause, so only aggregate functions, columns named in 
the GROUP BY clause, and constant values can be included in this list. Because 
of that, the AVG function is repeated in the select list. 

When used in the FROM clause, a subquery is similar to a view. As you 
learned in chapter 1, a view is a predefined SELECT statement that’s saved with 
the database. Because it’s saved with the database, a view typically performs 
more efficiently than an inline view. However, it isn’t always practical to use a 
view. In those cases, inline views can be quite useful. In addition, inline views 
can be useful for testing possible solutions before creating a view. Then, once 
the inline view works the way you want it to, you can define the view based on 
the subquery you used to create the inline view. 
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A query that uses an inline view to retrieve all vendors with an average 
invoice total over 4900 


SELECT i.vendor_ id, MAX(invoice date) AS last invoice date, 
AVG(invoice_total) AS average_invoice_total 
FROM invoices i JOIN 
( 
SELECT vendor id, AvG(invoice total) AS average_invoice total 
FROM invoices 
HAVING AvG(invoice_ total) > 4900 
GROUP BY vendor_id 
) v 
ON i.vendor_id = v.vendor_id 
GROUP BY i.vendor_id 
ORDER BY MAX (invoice date) DESC 


The result of the subquery (an inline view) 


¥ENDOR_ID AVERAGE_INYOICE_TOTAL 
10963.655 
4901 26 


23978,482 
6940.25 
7125.34 


The result set 


H venvor_o lA Lastnvoice_pate | AvERAGE_INVOICE_TOTAL 
72 18-JUL-08 40963.655 
119 04-JUN-08 4901.26 


104 03-JUN-08 7125.34 
99 31-MAY-08 6940.25 
110 08-MAY-08 23976.482 


Description 
e A subquery that’s coded in the FROM clause returns a result set that can be referred to as 


an inline view. When you create an inline view, you must assign an alias to it. Then, you 
can use the inline view within the outer query just as you would any other table. 


e When you code a subquery in the FROM clause, you must assign names to any calcu- 
lated values in the result set. 

e Inline views are most useful when you need to further summarize the results of a sum- 
mary query. 

e An inline view is like a view in that it retrieves selected rows and columns from one or 
more base tables. Because views are stored as part of the database, they’re typically more 
efficient to use than inline views. However, it may not always be practical to construct 
and save a view in advance. 


Figure 6-9 How to code subqueries in the FROM clause 
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How to code subqueries in the SELECT clause 


Figure 6-10 shows you how to use subqueries in the SELECT clause. As 
you can see, you can use a subquery in place of a column specification. Because 
of that, the subquery must return a single value. 

In most cases, the subqueries you use in the SELECT clause will be corre- 
lated subqueries. The subquery in this figure, for example, calculates the 
maximum invoice date for each vendor in the Vendors table. To do that, it refers 
to the vendor_id column from the Invoices table in the outer query. 

Because subqueries coded in the SELECT clause are difficult to read, you 
shouldn’t use them unless you can’t find another solution. In most cases, 
though, you can replace the subquery with a join. The first query shown in this 
figure, for example, could be restated as shown in the second query. This query 
joins the Vendors and Invoices tables, groups the rows by vendor_name, and 
then uses the MAX function to calculate the maximum invoice date for each 
vendor. As you can see, this query is much easier to read than the one with the 
subquery. 
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A query that uses a correlated subquery in its SELECT clause to retrieve 
the most recent invoice for each vendor 


SELECT vendor_name, 

(SELECT MAX(invoice_ date) FROM invoices 

WHERE invoices.vendor id = vendors.vendor_id) AS latest_inv 
FROM vendors 
ORDER BY latest inv 


The result set 


m YENDOR_NA&ME a LATEST_INY 
1 IBM 14-MAR-08 
2 Wang Laboratories, Inc. 16-4PR-08 
3 Retter's Sclertific & Pro Books 17-4PR-08 


4 United Parcel Service 26-4PR-08 
5 Waketleld Co 26-4,PR-08 
6 Zylka Design 01-May-08 
7 Abbey Office Furnishings 02-MaY-08 


(122 rows) 


The same query restated using a join 


SELECT vendor_name, MAX(invoice date) AS latest_inv 
FROM vendors v 

LEFT JOIN invoices i ON v.vendor_id = i.vendor_id 
GROUP BY vendor _name 
ORDER BY latest_inv 


Description 


e When you code a subquery for a column specification in the SELECT clause, the 
subquery must return a single value. 


e A subquery that’s coded within a SELECT clause is typically a correlated subquery. 


e A query that includes a subquery in its SELECT clause can typically be restated using a 
join instead of the subquery. Because a join is usually faster and more readable, 
subqueries are seldom coded in the SELECT clause. 


Figure 6-10 How to code subqueries in the SELECT clause 
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Guidelines for working with complex 
queries 


So far, the examples you’ve seen of queries that use subqueries have been 
relatively simple. However, these types of queries can get complicated in a 
hurry, particularly if the subqueries are nested. Because of that, you’1l want to 
be sure that you plan and test these queries carefully. You’ll learn a procedure 
for doing that in a moment. But first, you’ll see a complex query that illustrates 
the type of query we’re talking about. 


A complex query that uses subqueries 


Figure 6-11 presents a query that uses three subqueries. The first subquery 
is used in the FROM clause of the outer query to create a result set that contains 
the state, name, and total invoice amount for each vendor in the Vendors table. 
The second subquery is also used in the FROM clause of the outer query to 
create a result set that’s joined with the first result set. This result set contains 
the state and total invoice amount for the vendor in each state that has the 
largest invoice total. To create this result set, a third subquery is nested within 
the FROM clause of the subquery. This subquery is identical to the first 
subquery. 

After the two result sets are created, they’re joined based on the columns in 
each table that contain the state and the total invoice amount. The final result set 
includes the state, name, and total invoice amount for the vendor in each state 
with the largest invoice total. This result set is sorted by state. 

As you can see, this query is quite complicated and difficult to understand. 
In fact, you might be wondering if there isn’t an easier solution to this problem. 
For example, you might think that you could solve the problem simply by 
joining the Vendors and Invoices tables and creating a grouped aggregate. If you 
grouped by vendor state, however, you wouldn’t be able to include the name of 
the vendor in the result set. And if you grouped by vendor state and vendor 
name, the result set would include all the vendors, not just the vendor from each 
state with the largest invoice total, 

If you think about how else you might solve this query, we think you’ll 
agree that the solution presented here is fairly straightforward. However, in 
figure 6-13, you’ll learn one way to simplify this query. In particular, you’ll 
learn how to code a single Summary subquery instead of coding the Summary] 
and Summary2 subqueries shown here. 


A query that uses three subqueries 


SELECT summaryl.vendor state, summaryl.vendor name, 
top_in_ state.sum_ of invoices 


FROM 
( 
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SELECT v_sub.vendor state, v_sub.vendor name, 
SUM(i_sub.invoice total) AS sum_of invoices 


FROM invoices i sub JOIN vendors v_sub 
ON i_sub.vendor_ id = v_sub.vendor id 


GROUP BY v_sub.vendor state, v_sub.vendor name 
) summaryl 


JOIN 
( 


SELECT summary2.vendor state, 


GROUP BY summary2.vendor state 


MAX (summary2.sum_of invoices) AS sum_of invoices 
FROM 


( 


SELECT v_sub.vendor state, v_sub.vendor name, 
SUM(i_sub.invoice total) AS sum_of_invoices 
FROM invoices i sub JOIN vendors v_sub 


ON i_sub.vendor_id = 


v_sub.vendor id 


GROUP BY v_sub.vendor state, v_sub.vendor name 


) summary2 


) top_in_ state 

ON summaryl.vendor state = top_in state.vendor state AND 
summaryl.sum_of invoices = top in state.sum of invoices 
ORDER BY summaryl.vendor state 


The result set 


VENDOR_STATE |] VENDOR_NAME 


J SUM_OF_INVOICES 
Wells Fargo Bank BB2 
Digtal Dreamworks 7125.34 
Retter's Sclartitlc & Pro Books 800 


Dean Witter Reynolds 1367.5 
Malloy Lithographing Inc 119892.41 
United Parcel Service 23177.96 


(10 rows) 


Edward Data Services 207.78 


How the query works 


e This query retrieves the vendor from each state that has the largest invoice total. To do 
that, it uses three subqueries: Summary1, Summary2, and Top_In_State. The Summary1 
and Top_In_State subqueries are joined in the FROM clause of the outer query, and the 
Summary2 subquery is nested within the FROM clause of the Top_In_State subquery. 


e The Summaryl and Summary2 subqueries are identical. They join data from the Vendors 
and Invoices tables and produce a result set that includes the sum of invoices for each 


vendor grouped by vendor name within state. 


e The Top_In_State subquery produces a result set that includes the vendor state and the 
largest sum of invoices for any vendor in that state. This information is retrieved from 
the results of the Summary2 subquery. 


Figure 6-11 


A complex query that uses subqueries 
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A procedure for building complex queries 


To build a complex query like the one in the previous figure, you can use a 
procedure like the one in figure 6-12. To start, you should state the problem to 
be solved so that you’re clear about what you want the query to accomplish. In 
this case, the question is, “Which vendor in each state has the largest invoice 
total?” 

Once you’re clear about the problem, you should outline the query using 
pseudocode. Pseudocode is simply code that represents the intent of the query, 
but doesn’t necessarily use SQL code. The pseudocode shown in this figure, for 
example, uses part SQL code and part English. Notice that this pseudocode 
identifies the two main subqueries. Because these subqueries define inline 
views, the pseudocode also indicates the alias that will be used for each: sum- 
mary1 and top_in_state. That way, you can use these aliases in the pseudocode 
for the outer query to make it clear where the data it uses comes from. 

If it isn’t clear from the pseudocode how each subquery will be coded, or, as 
in this case, if a subquery is nested within another subquery, you can also write 
pseudocode for the subqueries. For example, the pseudocode for the 
top_in_state query is presented in this figure. Because this subquery has a 
subquery nested in its FROM clause, that subquery is identified in this 
pseudocode as Summary2. 

The next step in the procedure is to code and test the actual subqueries to be 
sure they work the way you want them to. For example, the code for the Sum- 
maryl and Summary2 queries is shown in this figure, along with the results of 
these queries and the results of the top_in_state query. Once you’re sure that the 
subqueries work the way you want them to, you can code and test the final 
query. 

If you follow the procedure presented in this figure, you’ll find it easier to 
build complex queries that use subqueries. Before you can use this procedure, of 
course, you need to have a thorough understanding of how subqueries work and 
what they can do. So you’ll want to be sure to experiment with the techniques 
you learned in this chapter before you try to build a complex query like the one 
shown here. 
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A procedure for building complex queries 


1. State the problem to be solved by the query in English. 

2. Use pseudocode to outline the query. The pseudocode should identify the subqueries 
used by the query and the data they return. It should also include aliases used for any 
inline views. 

3. If necessary, use pseudocode to outline each subquery. 

4. Code the subqueries and test them to be sure that they return the correct data. 


5. Code and test the final query. 


The problem to be solved by the query in figure 6-11 
Which vendor in each state has the largest invoice total? 


Pseudocode for the query 


SELECT summaryl.vendor_state, summaryl.vendor_name, 
top in state.sum of invoices 
FROM (inline view returning vendor_state, vendor_name, sum_of invoices) 
AS summaryl 
JOIN (inline view returning vendor_state, max(sum_of invoices) ) 
AS top in state 
ON summaryl.vendor_state = top in state.vendor_state AND 
summaryl.sum of invoices = top in state.sum of invoices 
ORDER BY summaryl.vendor_state 


Pseudocode for the Top_In_State subquery 


SELECT summary2.vendor state, MAX(summary2.sum_ of invoices) 

FROM (inline view returning vendor _state, vendor_name, sum_of_ invoices) 
AS summary2 

GROUP BY summary2.vendor_state 


The code for the Summary1 and Summary2 subqueries 


SELECT v_sub.vendor state, v_sub.vendor name, 
SUM(i_sub.invoice total) AS sum_of_ invoices 
FROM invoices i_sub JOIN vendors v_sub 
ON i_sub.vendor_id = v_sub.vendor_id 
GROUP BY v_sub.vendor state, v_sub.vendor name 
ORDER BY v_sub.vendor_ state, v_sub.vendor_name 


The result of the Summary1 and Summary2 subqueries 


VENDOR_STATE VENDOR_NAME SUM_OF _INYOICES 
Wells Fargo Bank 


Abbey Office Furnishings 
Bertelsmann Industry Svcs. Inc 


(34 rows) 


The result of the Top_In_State subquery 


VENDOR_STATE |A SUM_OF_INVOICES 
1 CA 7125.34 a 
2 MA 1367.5 E 
E 
hd 


3 OH 207.78 


(10 rows) 


Figure 6-12 A procedure for building complex queries 


203 


204 


Section2 The essential SQL skills 


Two more skills for working with 
subqueries 


Now that you’ve learned how to code complex queries, you’re ready to learn 
two more skills that are closely related to coding subqueries. These skills make 
it possible to simplify queries that could also be coded with subqueries. 


How to code a subquery factoring clause 


Subquery factoring allows you to name a block of code that contains a 
SELECT statement. Fox example, figure 6-13 shows how to use subquery 
factoring to simplify the complex query presented in figure 6-11. To start, the 
statement for the query begins with the WITH keyword to identify a subquery 
factoring clause. Then, it specifies Summary as the name for the first subquery, 
followed by the AS keyword, followed by an opening parenthesis, followed by a 
SELECT statement that defines the subquery, followed by a closing parenthesis. 
In this figure, for example, this statement returns the same result set as the 
subqueries named Summary! and Summary2 that were presented in figure 6-11. 

After the first subquery is defined, this example continues by defining a 
second subquery named top_in_state. To start, a comma is coded to separate the 
two subqueries. Then, this code specifies top_in_state as the name for the 
second subquery, followed by the AS keyword, followed by an opening paren- 
thesis, followed by a SELECT statement that defines the subquery, followed by 
a closing parenthesis. Here, this SELECT statement refers to the first subquery, 
named Summary. When coding multiple subqueries within a subquery factoring 
clause, a subquery can refer to any subquery coded before it, but it can’t refer to 
subqueries coded after it. For example, this statement wouldn’t work if the two 
subqueries were coded in the reverse order. 

Finally, the SELECT statement that’s coded immediately after the two 
named subqueries uses both of these subqueries, just as if they were tables. To 
do that, this SELECT statement joins the two subqueries, specifies the columns 
to retrieve, and specifies the sort order. To avoid ambiguous references, each 
column is qualified by the name for the subquery. 

If you compare figure 6-13 with figure 6-11, we think you’ll agree that the 
code in figure 6-13 is easier to read. That’s partly because the tables defined by 
the subqueries aren’t nested within the SELECT statement. In addition, the code 
in figure 6-13 is easier to maintain because the Summary query is coded in one 
place, not in two. 

When using the syntax shown here to define a subquery factoring clause, 
you must supply distinct names for all columns defined by the SELECT state- 
ment, including calculated values. That way, it’s possible for other statements to 
refer to the columns in the result set. Most of the time, that’s all you need to 
know to be able to work with a subquery factoring clause. 
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The syntax of a subquery factoring clause 


WITH query namel AS (query definition1) 
[, query_name2 AS (query definition2)] 
Ee ee ] 

sql_statement 


Two query names and a query that uses them 


WITH summary AS 
( 
SELECT vendor_state, vendor_name, SUM(invoice_total) AS sum_of_invoices 
FROM invoices 
JOIN vendors ON invoices.vendor_id = vendors.vendor_id 
GROUP BY vendor state, vendor_name 
), 
top_in_ state AS 
( 
SELECT vendor state, MAX(sum_of invoices) AS sum_of invoices 
FROM summary 
GROUP BY vendor state 
) 
SELECT summary.vendor state, summary.vendor name, 
top_in state.sum_of invoices 
FROM summary JOIN top in state 
ON summary.vendor state = top in state.vendor state AND 
summary.sum_of invoices = top _in state.sum_ of invoices 
ORDER BY summary.vendor state 


The result set 


VENDOR_STATE YENDOR_NAME SUM_OF_INYOICES 
Wells Fargo Bank 662 
Digital Dreamworks 7125.34 
Reiter's Sciartitic & Pro Books 600 


Dean Vitter Reynolds 1367.5 
Malloy Lithographing Inc 119892.41 
United Parcel Service 23177.96 
Edward Data Services 207.78 


(10 rows) 


Description 


e A subquery factoring clause can be thought of as a named subquery block. This name 
can then be used multiple times in the query. 


e To define a subquery factoring block, you code the WITH keyword followed by the 
definition of the subquery. 


e To code multiple subquery factoring clauses, separate them with commas. Then, each 
clause can refer to itself and any previously defined subquery factoring clauses in the 
same WITH clause. 


e ‘You can use subquery factoring clauses with SELECT, INSERT, UPDATE, and DE- 
LETE statements. However, you’re most likely to use them with SELECT statements, as 
shown in this figure. 


Figure 6-13 How to code a subquery factoring clause 
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How to code a hierarchical query 


A hierarchical query loops through a result set and returns rows in a hierar- 
chical sequence. Hierarchical queries are often used to work with organizational 
charts and other hierarchies, in which a parent element may have one or more 
child elements, and each child element may have one or more child elements. In 
figure 6-14, for example, the query shows the hierarchical levels for the employ- 
ees within a company. 

The Employees table uses the manager_id column to identify the manager 
for each employee. Here, Cindy Smith is the top-level manager since she 
doesn’t have a manager, Elmer Jones and Paulo Locario report to Cindy Smith, 
and so on. 

In this figure, the hierarchical query uses the LEVEL pseudo column to 
return a column that identifies the level of the employee within the hierarchy. To 
start, this query uses the LEVEL pseudo column in the SELECT clause to 
identify the third column. In addition, this query uses the LEVEL pseudo 
column in the ORDER BY clause to sort by this column. 

After the FROM clause, this query uses the START WITH clause to identify 
the row to be used as the root of the hierarchy. In this figure, for example, the 
query uses the employee_id column to identify Cindy Smith as the root of the 
hierarchy. However, if you wanted, you could use a difference expression after 
the START WITH keywords to identify this row or a different row. 

Finally, the CONNECT BY clause specifies the condition that identifies the 
relationship between parent rows and child rows. In this figure, for example, the 
CONNECT BY clause is followed by the PRIOR keyword to specify that a row 
is a child row if the row’s employee_id column equals the manager_id column 
of the other row. 
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A query that returns hierarchical data 


SELECT select list 

FROM table_source 

[WHERE search condition] 

START WITH row_specification 
CONNECT BY PRIOR connect_expression 
[ORDER BY order_by list] 


The Employees table 


@ empcovee_o |) First_wame|f] LastName |) DEPARTMENT _NUMBER |) MANAGER_ID 
1 Cindy Smith 2 Crull) 
2 Elmer Jones 4 4 
3 Ralph Simonian 2 2 
4 Olivia Hernandez 1 E] 
5 Robart Aaronsen 2 4 
6 Danisa Watson 6 8 
7 Thomas Hardy 5 2 
8 Rhea O'Leary 4 E] 
9 Paulo Locarlo 6 1 

A query that returns hierarchical data 
SELECT employee_id, 
first name || ' ' || last_name AS employee name, 


LEVEL 
FROM employees 
START WITH employee_id = 1 
CONNECT BY PRIOR employee id = manager id 
ORDER BY LEVEL, employee_id 


The result set 


EMPLOYEE_ID |) EMPLOYEE NAME |) LEVEL 
1 Cindy Smith 
2 Elmer Jones 
9 Paulo Locario 
3 Ralph Simonian 
4 Ollvia Hernandez 
7 Thomas Hardy 
6 Rhea O'Leary 
5 Robert Aeronsen 
6 Denise Yvetson 


1 
2 
3 
4 
5 
6 
7 
8 
a 


Ee PRU WU UO oO ee 


Description 


A hierarchical query is a query that returns rows in a hierarchical order. 

You can use the LEVEL pseudocolumn to identify the level for each row. 

You can use the START WITH clause to identify the row to be used as the root of the 
hierarchical query. 


You can use the CONNECT BY clause followed by the PRIOR keyword to specify a 
condition that identifies the relationship between parent rows and child rows. 


Figure 6-14 How to code a hierarchical query 
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Perspective 


As you’ve seen in this chapter, subqueries provide a powerful tool for solving 
difficult problems. Before you use a subquery, however, remember that a subquery 
can often be restated more clearly by using a join. If so, you’ll typically want to 
use a join instead of a subquery. 

If you find yourself coding the same subqueries over and over, you should 
consider creating a view for that subquery, as described in chapter 11. This will 
help you develop queries more quickly since you can use the view instead of 
coding the subquery again. In addition, since views typically execute more quickly 
than subqueries, this may improve the performance of your queries. 


Terms 


subquery 

introduce a subquery 
subquery search condition 
subquery predicate 

nested subquery 
correlated subquery 
inline view 

pseudocode 

subquery factoring 
hierarchical query 


Exercises 


1. Write a SELECT statement that returns the same result set as this SELECT 
statement but don’t use a join. Instead, use a subquery in a WHERE clause that 
uses the IN keyword. 


SELECT DISTINCT vendor_name 
FROM vendors JOIN invoices 
ON vendors.vendor id = invoices.vendor_id 
ORDER BY vendor_name 
2. Write a SELECT statement that answers this question: Which invoices have a 
payment_total that’s greater than the average payment_total for all paid 
invoices? Return the invoice_number and invoice_total for each invoice. 


3. Write a SELECT statement that returns two columns from the 
General_Ledger_Accounts table: account_number and account_description. 
The result set should have one row for each account number that has never 
been used. Use a subquery introduced with the NOT EXISTS operator, and sort 
the final result set by account_number. 
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Write a SELECT statement that returns four columns: vendor_name, invoice_id, 
invoice_sequence, and line_item_amt for each invoice that has more than one 
line item in the Invoice_Line_Items table. Hint: Use a subquery that tests for 
invoice_sequence > 1. 


Write a SELECT statement that returns a single value that represents the sum of 
the largest unpaid invoices for each vendor (just one for each vendor). Use an 
inline view that returns MAX(invoice_total) grouped by vendor_id, filtering for 
invoices with a balance due. 


Rewrite exercise 6 so it uses subquery factoring. 


Write a SELECT statement that returns the name, city, and state of each vendor 
that’s located in a unique city and state. In other words, don’t include vendors 
that have a city and state in common with another vendor. 


Use a correlated subquery to return one row per vendor, representing the 
vendor’s oldest invoice (the one with the earliest date). Each row should include 
these four columns: vendor_name, invoice_number, invoice_date, and 
invoice_total. 


Rewrite exercise 8 so it gets the same result but doesn’t use a correlated 
subquery. 
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How to insert, update, and 
delete data 


In the last four chapters, you learned how to code the SELECT statement to 
retrieve and summarize data. Now, you’ll learn how to code the INSERT, 
UPDATE, and DELETE statements to modify the data in a table. When you’re 
done with this chapter, you’ll know how to code the four statements that are 
used every day by professional application developers. 


How to create test tables ......cscssesesosesersocssereroncnereneosonererones 212 


How to create the tables for this book ..........csssssssesesessessessessneseeseesersrees 212 
Howitoicreatejaicopyio fatale nni E 212 
How to commit and rollback changes ........::ssssssssesesesessess 214 
Howto commit ehangos maanneen asinis aaaeaii iesakiet 214 
Howto rollback changes nna siaaaorata 214 
How to insert NEW FOWS. „.ssssunssunsunnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn nnna 216 
BAB yo Cen Tmt a BUN ae setreccestcengeipenilsdanseincsense Giver nctaraievedetaciebgntessaneld 216 
How to insert default values and null values ............:sscsscsssreccsceessessensenees 218 
How to use a subquery to insert multiple rows .......csscsecesesetseeeeeteesearsetsens 220 
How to update existing rows ESESESESESRERERESGESGESRGESREREREGEGEGEGEGBGBES 222 
Howto Update ew a ssis. sescsesusweneiparsveseviesesssaasisnevessisssnessastssvésvsscateassancsess 222 
How to use a subquery in an UPDATE statement ...........ccsssssersersersereseees 224 
How to delete existing rOWS .......ssssssscscsssssssssessssssssenereresss 226 
Howto delete TOWS te, wissscsnsssascaecsbadanacanincansenivknsareatalarseiieiuivcdisteRantaesaes 226 
How to use a subquery in a DELETE statement ..ssssesssssssssssssssessassssossasssse 226 
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How to create test tables 


As you practice coding INSERT, UPDATE, and DELETE statements, you 
need to make sure that your experimentation won’t affect “live” data that’s used 
by other people at your business or school. Two ways to get around that are 
presented next. 


How to create the tables for this book 


The first procedure in figure A-5 of appendix A shows how to create the 
tables that are used for the examples in this book. After you create these tables, 
you can modify them without worrying about how much you change them. If 
you ever want to restore the tables to their original data, you can use the second 
procedure in figure A-5. 


How to create a copy of a table 


If you want to test INSERT, UPDATE, and DELETE statements on tables 
that are running on a server that’s available from your business or school, you 
can create a copy of some or all of a table before you do any testing. To do that, 
you can use the CREATE TABLE statement with an embedded SELECT 
statement as shown in figure 7-1. Then, you can experiment all you want with 
the test tables and delete them when you’re done. When you use this technique, 
the result set that’s defined by the SELECT statement is simply copied into a 
new table. 

The three examples in this figure show some of the ways you can use this 
statement. Here, the first example copies all of the columns from all of the rows 
in the Invoices table into a new table named Invoices_Copy. The second ex- 
ample copies all of the columns in the Invoices table into a new table named 
Old_Invoices, but only for rows where the balance due is zero. And the third 
example creates a table that contains summary data from the Invoices table. 

When you’re done experimenting with test tables, you can use the DROP 
TABLE statement that’s shown in this figure to delete any tables you don’t need 
anymore. In this figure, for instance, the fourth example shows how to drop the 
Invoices_Copy table. 

When you use this technique to create tables, though, only the column 
definitions and data are copied, which means that definitions like those of 
primary keys, foreign keys, and default values aren’t retained. As a result, the 
results that you get when you test against copied tables may be slightly different 
than the results you would get with the original tables. You’ll understand that 
better after you read chapters 9 and 10. 
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The syntax of the CREATE TABLE AS statement 


CREATE TABLE table name AS 
SELECT select list 

FROM table source 

[WHERE search condition] 
[GROUP BY group by list] 
[HAVING search condition] 
[ORDER BY order by list] 


A statement that creates a complete copy of the Invoices table 


CREATE TABLE invoices copy AS 
SELECT * 
FROM invoices 


A statement that creates a partial copy of the Invoices table 


CREATE TABLE old invoices AS 

SELECT * 

FROM invoices 

WHERE invoice_total - payment_total - credit _total = 0 


A statement that creates a table with summary rows from the Invoices 
table 


CREATE TABLE vendor_balances AS 
SELECT vender _id, SUM(involce total) AS sum of_ invoices 
FROM invoices 


WHERE (invoice total - payment_total - credit_total) <> 0 
GROUP BY vendor_id 


A statement that deletes a table 
DROP TABLE invoices_copy 


Description 


e ‘You can create a new table based on the result set defined by the SELECT statement. 
Since the definitions of the columns in the new table are based on the columns in the 
result set, the column names assigned in the SELECT clause must be unique. 


e You can code the other clauses of the SELECT statement just as you would for any other 
SELECT statement, including grouping, aggregates, joins, and subqueries. 


e If you use calculated values in the select list, you must name the column since that name 
is used in the definition of the new table. 


e The table you name must not exist. If it does, you must delete the table by using the 
DROP TABLE statement before you execute the SELECT statement. 
Warning 


e When you use the SELECT statement to create a table, only the column definitions and 
data are copied. Definitions of primary keys, foreign keys, indexes, default values, and so 
on are not included in the new table. 


Figure 7-1 How to create a table from another table 
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How to commit and rollback changes 


When you use SQL Developer to execute INSERT, UPDATE, and DELETE 
statements, Oracle Database automatically adds those statements to a transac- 
tion. A transaction is a group of SQL statements that must all be executed 
successfully before they are saved to the database. To make the changes to the 
database permanent, you must explicitly commit the changes to the database. 
Otherwise, you can undo, or rollback, the changes. 

For example, the INSERT statement in figure 7-2 adds a single row to the 
Invoices table. If you use SQL Developer to execute this statement, the row will 
be added to the Invoices table. Then, you will be able to see this row if you 
execute a SELECT statement that selects this row from the Invoices table. 
However, any other users who may be using this database won’t be able to see 
this row until you commit this change to the database. 

When you exit SQL Developer, the Oracle Database will automatically 
rollback all INSERT, UPDATE, and DELETE statements that haven’t explicitly 
been committed. In other words, if you don’t explicitly commit changes, they 
will be lost when you exit SQL Developer. 

When you’re practicing with INSERT, UPDATE, and DELETE statements, 
you may want them rolled back. If you want to make permanent changes to a 
production system, though, you’ll need to make sure to commit the changes. For 
more information about working with transactions, see chapter 14. 


You can make the changes to the database permanent by executing the 
COMMIT statement shown in this figure. Or, if you’re using SQL Developer, 
you can commit the changes by clicking on the Commit button or pressing F11. 
When you do, SQL Developer will display a message that indicates whether the 
commit succeeded. 

Most of the time, you’ll want to use one of these techniques to manually 
commit your changes. However, if you want SQL Developer to automatically 
commit changes immediately after they are made, you can enable the 
“Autocommit in SQL Worksheet” feature as described in this figure. 


How to rollback changes 


You can rollback any INSERT, UPDATE, and DELETE statements that 
haven’t been committed yet by executing the ROLLBACK statement shown in 
this figure. Or, if you’re using SQL Developer, you can undo the changes by 
clicking on the Rollback button or pressing F12. When you do, SQL Developer 
will display a message that indicates whether the rollback succeeded. 
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An INSERT statement that adds a new row to the Invoices table 


INSERT INTO invoices 
VALUES (115, 97, "456789', '01-AUG-08', 8344.50, 0, 0, 1, '31-AUG-08', NULL) 


The response from the system 


1 rows inserted 


A COMMIT statement that commits the changes 
COMMIT 


The response from the system 
COMMIT succeeded 


A ROLLBACK statement that rolls back the changes 


ROLLBACK 


The response from the system 
ROLLBACK succeeded 


Description 


e A transaction is a group of SQL statements that must all be executed together. By 
default, Oracle adds INSERT, UPDATE, and DELETE statements to a transaction. 

e To commit the changes made by a transaction, you can issue the COMMIT statement. 
Or, if you’re using SQL Developer, you can click on the Commit button or press F11. 

e To rollback the changes made by a transaction, you can issue the ROLLBACK state- 
ment. Or, if you’re using SQL Developer, you can click on the Rollback button or press 
F12. 

e By default, if you don’t explicitly commit the changes made to the database by a transac- 
tion, the Oracle Database will rollback the changes when you exit SQL Developer. 

e If you want SQL Developer to automatically commit changes to the database immedi- 
ately after each INSERT, UPDATE, or DELETE statement is executed, use the 
Tools»Preferences command. Then, expand the Database node, click on the Worksheet 
Parameters node, and check the “Autocommit in SQL Worksheet” check box. 


Figure 7-2 How to rollback or commit changes 


215 


216 Section2 The essential SQL skills 


How to insert new rows 


To add new rows to a table, you use the INSERT statement. This statement 
lets you insert a single row with the values you specify or selected rows from 
another table. You’ll see how to use both forms of the INSERT statement in the 
topics that follow. In addition, you’ll learn how to work with default values and 
null values when you insert new rows. 


How to insert a single row 


Figure 7-3 shows how to code an INSERT statement to insert a single row. 
The two examples in this figure insert a row into the Invoices table. The data 
this new row contains is defined near the top of this figure. 

In the first example, you can see that you name the table in which the row 
will be inserted in the INSERT clause. Then, the VALUES clause lists the 
values to be used for each column. You should notice three things about this list. 
First, it includes a value for every column in the table. Second, the values are 
listed in the same sequence that the columns appear in the table. That way, 
Oracle knows which value to assign to which column. And third, a null value is 
assigned to the last column, payment_date, using the NULL keyword. You'll 
learn more about using this keyword in the next topic. 

The second INSERT statement in this figure includes a column list in the 
INSERT clause. Notice that this list doesn’t include the payment_date column 
since it allows a null value. In addition, the columns aren’t listed in the same 
sequence as the columns in the Invoices table. When you include a list of 
columns, you can code the columns in any sequence you like. Then, you just 
need to be sure that the values in the VALUES clause are coded in the same 
sequence. 

When you specify the values for the columns to be inserted, you must be 
sure that those values are compatible with the data types of the columns. For 
example, you must enclose literal values for dates and strings within single 
quotes. However, you don’t need to enclose literal values for numbers in single 
quotes. You’ll learn more about data types and how to work with them in the 
next chapter. For now, just realize that if any of the values aren’t compatible 
with the data types of the corresponding columns, an error will occur and the 
row won’t be inserted. 
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The syntax of the INSERT statement for inserting a single row 


INSERT INTO table name [(column list)] 
VALUES (expression_1 [, expression_2]...) 


The values for a new row to be added to the Invoices table 
Column 


invoice_id payment_total 


vendor_id credit_total 

invoice_number 456789 terms_id 

invoice_date 8/01/2008 invoice_due_date 8/31/2008 
invoice_total 8,344.50 payment_date null 


An INSERT statement that adds the new row without using a column list 


INSERT INTO invoices 
VALUES (115, 97, '456789', '01-AUG-08', 8344.50, 0, 0, 1, '31-AUG-08', NULL) 


(1 rows inserted) 


An INSERT statement that adds the new row using a column list 


INSERT INTO invoices 
(invoice_id, vendor_id, invoice number, invoice total, payment total, 
credit total, terms_id, invoice _date, invoice due date) 

VALUES 
(115, 97, '456789', 8344.50, 0, 0, 1, '01-AUG-08', '31-AUG-08') 


(1 rows inserted) 


Description 
e You use the INSERT statement to add a new row to a table. 


e Inthe INSERT clause, you specify the name of the table that you want to add a row to, 
along with an optional column list. The INTO keyword is required. 

e You specify the values to be inserted in the VALUES clause. The values you specify 
depend on whether you include a column list. 

e If you don’t include a column list, you must specify the column values in the same order 
as they appear in the table, and you must code a value for each column in the table. 

e If you include a column list, you must specify the column values in the same order as 
they appear in the column list. You can omit columns with default values and columns 
that accept null values. 


e To insert a null value into a column, you can use the NULL keyword. To insert a default 
value, you can use the DEFAULT keyword. See figure 7-4 for more information on using 
these keywords. 


Figure 7-3 How to insert a single row 
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How to insert default values and null values 


If a column allows null values, you’ll want to know how to insert a null 
value into that column. Similarly, if a column is defined with a default value, 
you'll want to know how to insert that value. The technique you use depends on 
whether the INSERT statement includes a column list, as shown by the ex- 
amples in figure 7-4. 

All of these INSERT statements use a table named Color_Sample. This 
table contains the three columns shown at the top of this figure. The first 
column, color_id, represents the internal ID for the column. The second column, 
color_number, is defined with a default value of 0. And the third column, 
color_name, is defined so that it allows null values. 

The first two statements illustrate how you assign a default value or a null 
value using a column list. To do that, you simply omit the column from the list. 
In the first statement, for example, the column list names only the color_number 
column, so the color_name column is assigned a null value. Similarly, the 
column list in the second statement names only the color_name column, so the 
color_number is assigned its default value. 

The next three statements show how you assign a default or null value to a 
column without including a column list. As you can see, you do that by using 
the DEFAULT and NULL keywords. For example, the third statement specifies 
a value for the color_name column, but uses the DEFAULT keyword for the 
color_number column. Because of that, Oracle will assign a value of zero to this 
column. The fourth statement assigns a value of 808 to the color_number 
column, and it uses the NULL keyword to assign a null value to the color_name 
column. Finally, the fifth statement uses both the DEFAULT and NULL key- 
words to assign a value of zero to the color_number column and a null value to 
the color_name column. 
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The definition of the Color_Sample table 
Column name Data Type Not Null Default Value 
color_id NUMBER 


color_number NUMBER 
color_name VARCHAR2 


Five INSERT statements for the Color_Sample table 


INSERT INTO color_sample (color_id, color_number) 
VALUES (1, 606) 


INSERT INTO color_sample (color_id, color_name) 
VALUES (2, 'Yellow') 


INSERT INTO color sample 
VALUES (3, DEFAULT, 'Orange') 


INSERT INTO color_sample 
VALUES (4, 808, NULL) 


INSERT INTO color sample 
VALUES (5, DEFAULT, NULL) 


The Color_Sample table after the rows are inserted 


COLOR_ID |P) CoLOR_NUMBER |] COLOR_NAME 
606 (null) 
D Yellow 


0 Orange 
808 (null) 
D (nulld 


Description 


e Ifa column is defined so it allows null values, you can use the NULL keyword in the list 
of values to insert a null value into that column. 


e Ifa column is defined with a default value, you can use the DEFAULT keyword in the 
list of values to insert the default value for that column. 


e If you include a column list, you can omit columns with default values and null values. 
Then, the default value or null value is assigned automatically. 


Figure 7-4 How to insert default values and null values 
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How to use a subquery to insert multiple rows 


Instead of using the VALUES clause of the INSERT statement to specify the 
values for a single row, you can use a subquery to select the rows you want to 
insert from another table. Figure 7-5 shows you how to do that. 

Both examples in this figure retrieve rows from the Invoices table and insert 
them into a table named Invoice_Archive. This table is defined with the same 
columns as the Invoices table. However, the payment_total and credit_total 
columns aren’t defined with default values. Because of that, you must include 
values for these columns. 

The first example in this figure shows how you can use a subquery in an 
INSERT statement without coding a column list. In this example, the SELECT 
clause of the subquery is coded with an asterisk so that all the columns in the 
Invoices table will be retrieved. Then, after the search condition in the WHERE 
clause is applied, all the rows in the result set are inserted into the 
Invoice_Archive table. 

The second example shows how you can use a column list in the INSERT 
clause when you use a subquery to retrieve rows. Just as when you use the 
VALUES clause, you can list the columns in any sequence. However, the 
columns must be listed in the same sequence in the SELECT clause of the 
subquery. In addition, you can omit columns that are defined with default values 
or that allow null values. 

Notice that the subqueries in these statements aren’t coded within parenthe- 
ses as a subquery in a SELECT statement is. That’s because they’re not coded 
within a clause of the INSERT statement. Instead, they’re coded in place of the 
VALUES clause. 

Before you execute INSERT statements like the ones shown in this figure, 
you'll want to be sure that the rows and columns retrieved by the subquery are 
the ones you want to insert. To do that, you can execute the SELECT statement 
by itself. Then, when you’re sure it retrieves the correct data, you can add the 
INSERT clause to insert the rows into another table. 
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The syntax of the INSERT statement for inserting rows selected from 
another table 


INSERT [INTO] table_name [(column_list)] 
SELECT column list 

FROM table source 

[WHERE search condition] 


An INSERT statement that inserts paid invoices in the Invoices table into 
the Invoice_Archive table 


INSERT INTO invoice_archive 

SELECT * 

FROM invoices 

WHERE invoice total - payment total - credit total = 0 


(74 rows inserted) 


The same INSERT statement with a column list 


INSERT INTO invoice archive 
(invoice_id, vendor_id, invoice_number, invoice_total, credit_total, 
payment total, terms_id, invoice date, invoice due date) 

SELECT 
invoice id, vendor_id, invoice number, invoice total, credit total, 
payment total, terms_id, invoice date, invoice_due date 

FROM invoices 

WHERE invoice total - payment_total - credit_total = 0 


(74 rows inserted) 


Description 


e To insert rows selected from one or more tables into another table, you can code a 
subquery in place of the VALUES clause. Then, the rows in the derived table that result 
from the subquery are inserted into the table. 


e If you don’t code a column list in the INSERT clause, the subquery must return values 
for all the columns in the table where the rows will be inserted, and the columns must be 
returned in the same order as they appear in that table. 

e If you include a column list in the INSERT clause, the subquery must return values for 
those columns in the same order as they appear in the column list. You can omit columns 
with default values and columns that accept null values. However, it is good program- 
ming practice to always include a column list. 


Figure 7-5 How to use a subquery to insert multiple rows 
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How to update existing rows 


To modify the data in one or more rows of a table, you use the UPDATE 
statement. Although most of the UPDATE statements you code will perform 
simple updates, you can also code more complex UPDATE statements that 
include subqueries. 


How to update rows 


Figure 7-6 presents the syntax of the UPDATE statement. As you can see in 
the examples, most UPDATE statements include just the UPDATE, SET, and 
WHERE clauses. The UPDATE clause names the table to be updated, the SET 
clause names the columns to be updated and the values to be assigned to those 
columns, and the WHERE clause specifies the condition a row must meet to be 
updated. Although the WHERE clause is optional, you’ll almost always include 
it. If you don’t, all of the rows in the table will be updated, which usually isn’t 
what you want. 

The first UPDATE statement in this figure modifies the values of two 
columns in the Invoices table: payment_date and payment_total. Because the 
WHERE clause in this statement identifies a specific invoice number, only the 
columns in that invoice will be updated. Notice in this example that the value to 
be assigned to payment_date is coded as a literal. You should realize, however, 
that you can assign any valid expression to a column as long as it evaluates to a 
value that’s compatible with the data type of the column. You can also use the 
NULL keyword to assign a null value to a column that allows nulls, and you can 
use the DEFAULT keyword to assign the default value to a column that’s 
defined with one. 

The second UPDATE statement modifies a single column in the Invoices 
table: terms_id. This time, however, the WHERE clause specifies that all the 
rows for vendor 95 should be updated. Because this vendor has six rows in the 
Invoices table, all six rows will be updated. 

The third UPDATE statement illustrates how you can use an expression to 
assign a value to a column. In this case, the expression increases the value of the 
credit_total column by 100. Like the first UPDATE statement, this statement 
updates a single row. 

Before you execute an UPDATE statement, you’ll want to be sure that 
you’ve selected the correct rows. To do that, you can execute a SELECT state- 
ment with the same search condition. Then, if the SELECT statement returns 
the correct rows, you can change it to an UPDATE statement. 
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The syntax of the UPDATE statement 


UPDATE table name 
SET column name_1 = expression_1 [, column_name 2 = expression_2]... 
[WHERE search condition] 


An UPDATE statement that assigns new values to two columns of a single 
row in the Invoices table 


UPDATE invoices 

SET payment_date = '21-SEP-08', 
payment total = 19351.18 

WHERE invoice number = '97/522! 


(1 rows updated) 


An UPDATE statement that assigns a new value to one column of all in- 
voices for a vendor 


UPDATE invoices 
SET terms id = 1 
WHERE vendor_id = 95 


{6 rows updated) 


An UPDATE statement that uses an arithmetic expression to assign a 
value to a column 


UPDATE invoices 
SET credit total = credit total + 100 
WHERE invoice number = '97/522' 


{1 rows updated) 


Description 


e ‘You use the UPDATE statement to modify one or more rows in the table named in the 
UPDATE clause. 


e ‘You name the columns to be modified and the value to be assigned to each column in the 
SET clause. You can specify the value for a column as a literal or an expression. 


e ‘You can specify the conditions that must be met for a row to be updated in the WHERE 
clause. 


e ‘You can use the DEFAULT keyword to assign the default value to a column that has one, 
and you can use the NULL keyword to assign a null value to a column that allows nulls. 
For more information, see figure 7-4. 


Warning 
e Ifyou omit the WHERE clause, all rows in the table will be updated. 


Figure 7-6 How to update rows 
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How to use a subquery in an UPDATE statement 


Figure 7-7 presents three UPDATE statements that illustrate how you can 
use subqueries in an update operation. In the first statement, a subquery is used 
in the SET clause to retrieve the maximum invoice due date from the Invoices 
table. Then, that value is assigned to the invoice_due_date column for invoice 
number 97/522. 

In the second statement, a subquery is used in the WHERE clause to 
identify the invoices to be updated. This subquery returns the vendor_id value 
for the vendor in the Vendors table with the name “Pacific Bell.” Then, all the 
invoices with that vendor_id value are updated. 

The third UPDATE statement also uses a subquery in the WHERE clause. 
This subquery returns a list of the vendor_id values for all vendors in California, 
Arizona, and Nevada. Then, the IN operator is used to update all the invoices 
with vendor_id values in that list. Note that although the subquery returns 80 
vendors, many of these vendors don’t have invoices. As a result, the UPDATE 
statement only affects 51 invoices. 
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An UPDATE statement that assigns the maximum due date in the Invoices 
table to a specific invoice 


UPDATE invoices 
SET credit total = credit total + 100, 

invoice due date (SELECT MAX(inveice due date) FROM invoices) 
WHERE invoice_number 197/522! 


(1 rows updated) 


An UPDATE statement that updates all invoices for a vendor based on the 
vendor’s name 
UPDATE invoices 
SET terms id = 1 
WHERE vendor_id = 
(SELECT vendor id 
FROM vendors 
WHERE vendor name = 'Pacific Bell') 


(6 rows updated) 


An UPDATE statement that changes the terms of all invoices for vendors 
in three states 


UPDATE invoices 
SET terms id = 1 
WHERE vendor _id IN 
(SELECT vendor_id 
FROM vendors 
WHERE vendeor_state IN ('CA', 'AZ', ‘NV')) 


{51 rows updated) 


Description 


e You can code a subquery in the SET or WHERE clause of an UPDATE statement. 
e You can use a subquery in the SET clause to return the value that’s assigned to a column. 


e ‘You can code a subquery in the WHERE clause to provide one or more values used in 
the search condition. 


Figure 7-7 How to use subqueries in an UPDATE statement 
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How to delete existing rows 


To delete one or more rows from a table, you use the DELETE statement. 
Just as you can with the UPDATE statement, you can use subqueries in a 
DELETE statement to help identify the rows to be deleted. 


How to delete rows 


Figure 7-8 presents the syntax of the DELETE statement along with three 
examples that illustrate some basic delete operations. As you can see, you 
specify the name of the table that contains the rows to be deleted in the DE- 
LETE clause. You can also code the FROM keyword in this clause, but this 
keyword is optional so it can be omitted. 

To identify the rows to be deleted, you code a search condition in the 
WHERE clause. Although this clause is optional, you’ll almost always include 
it. If you don’t, all of the rows in the table are deleted. This is a common coding 
mistake, and it can be disastrous if you’re working with live data. 

If you want to make sure that you’ve selected the correct rows before you 
issue the DELETE statement, you can issue a SELECT statement with the same 
search condition. Then, if the correct rows are retrieved, you can use the same 
search for the DELETE statement. 

The first DELETE statement in this figure deletes a single row from the 
Invoice_Line_Items table. To do that, it specifies the invoice_id value of the row 
to be deleted and the invoice_sequence value in the WHERE clause. 

The second DELETE statement deletes four rows from the Invoice_Line_Items 
table. To do that, it specifies 100 as the invoice_id value of the row to be deleted 
in the WHERE clause. Since the invoice for this ID has four line items, this 
deletes all four line items for the invoice. 

If you try to delete a row that has one or more child rows that are defined 
with a foreign-key constraint, Oracle will return an error message and won’t 
delete the row. For example, if you attempt to delete a row from the Vendors 
table that has child rows in the Invoices and Invoice_Line_Items tables, Oracle 
will return an error message that indicates that an integrity constraint was 
violated, and it won’t delete the vendor. Usually, that’s what you want. 


How to use a subquery in a DELETE statement 


If you really want to delete a row that has one or more child rows from the 
Vendors table, you can start by deleting all of the invoices and line items for that 
vendor. To do that, you can use a statement like the third one in this figure to 
delete the line items for the vendor from the Invoice_Line_Items table. Note 
how this statement uses a subquery to delete all line items for the vendor with 
the ID of 115. Then, you can use a similar statement to delete all of the invoices 
for that vendor from the Invoices table. Finally, you can use a simple DELETE 
statement to delete the row for the vendor from the Vendors table. 
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The syntax of the DELETE statement 


DELETE [FROM] table name 
[WHERE search condition] 


A DELETE statement that deletes one row 


DELETE FROM invoice line items 
WHERE invoice id = 100 AND invoice sequence = 1 


(1 rows deleted) 


A DELETE statement that deletes four rows 


DELETE FROM invoice line items 
WHERE invoice_id = 100 


(4 rows deleted) 


A DELETE statement that uses a subquery to delete all invoice line items 
for a vendor 


DELETE FROM invoice line items 
WHERE invoice _id IN 

(SELECT invoice_id 

FROM invoices 

WHERE vendor_id = 115) 


(4 rows deleted) 


Description 


e ‘You can use the DELETE statement to delete one or more rows from the table you name 
in the DELETE clause. 


e You specify the conditions that must be met for a row to be deleted in the WHERE 
clause. 


e You can use a subquery within the WHERE clause. 


e A foreign-key constraint may prevent you from deleting a row. In that case, you can only 
delete the row if you delete all child rows for that row first. 


Warning 


e Ifyou omit the WHERE clause from a DELETE statement, all the rows in the table will 
be deleted. 


Figure 7-8 How to delete rows 
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Perspective 


In this chapter, you learned how to use the INSERT, UPDATE, and DELETE 
statements to modify the data in a database. In chapters 9 and 10, you’ll learn more 
about the table definitions that can affect the way these statements work. And in 
chapter 14, you’ll learn more about executing groups of INSERT, UPDATE, and 
DELETE statements as a single transaction. 


Terms 


transaction 
rollback 
commit 


Exercises 


To test whether a table has been modified correctly as you do these exercises, 
you can write and run an appropriate SELECT statement. Or, when you’re using 
Oracle SQL Developer, you can click on a table name in the Connections 
window and then on the Data tab to display the data for all of the columns in the 
table. To refresh the data on this tab, click the Refresh button. 


1. Write an INSERT statement that adds this row to the Invoices table: 


invoice_id The next id in sequence (find out what this should be) 
vendor_id: 32 

invoice_number: AX-014-027 

invoice_date: 8/1/2008 

invoice_total: $434.58 

payment_total: $0.00 

credit_total: $0.00 

terms_id: 2 

invoice_due_date: 8/31/2008 

payment_date: null 


2. Write an UPDATE statement that modifies the Vendors table. Change the 
default account number to 403 for each vendor that has a default account 
number of 400. 

3. Write an UPDATE statement that modifies the Invoices table. Change the 
terms_id to 2 for each invoice that’s for a vendor with a default_terms_id of 2. 

4. Write a DELETE statement that deletes the row that you added to the Invoices 
table in exercise 1. 

5. After you have verified that all of the modifications for the first four exercises 


have been successful, rollback the changes. Then, verify that they have been 
rolled back. 
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How to work with data 
types and functions 


In chapter 3, you were introduced to some of the built-in scalar functions such 
as the SUBSTR, TO_CHAR, and SYSDATE functions. Now, this chapter 
expands on that coverage by presenting more of the built-in scalar functions. 
Because most of these functions work with specific data types, this chapter 
begins by describing the data types for an Oracle database. 
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The built-in data types 


A column’s data type specifies the type of information that the column is 
intended to store. A column’s data type also determines the types of operations 
that can be performed on the data. 


Data type overview 


The Oracle data types can be divided into the five categories shown in the 
first table in figure 8-1. In this chapter, you'll learn how to work with the most 
important data types in the first three categories. Then, you can learn more 
about working with the data types in the temporal category in chapter 17, and 
you can learn how to work with the data types in the large object category in 
chapter 18. 

The character data types are used to store a string of one or more characters 
that can include letters, numbers, or special characters like the pound sign (#) or 
the at sign (@). The terms character, string, and text are used interchangeably 
to describe this type of data. 

The numeric data types are used to store numbers that can be used for 
mathematical calculations. These numbers can be integers, which are numbers 
that don’t contain decimal places, or they can be numbers that contain decimal 
places. They can also be floating-point numbers that are used to store approxi- 
mate values for very large and very small numbers. 

The temporal data types are used primarily to store dates and times. In 
Oracle terminology, these data types are referred to as the datetime, date/time, 
or just date types. Remember, though that these types always include a time 
component as well as a date component. Besides dates and times, the temporal 
data types include time intervals and timestamps, which you can learn more 
about in chapter 17. 

The large object (LOB) data types are used to store large amounts of text, 
images, sound, video, and so on. In the old days (you know, way back in the 80s 
and 90s), databases were primarily used to store character, numeric, and date/ 
time data. Today, however, you can use the large object types to store other 
types of data. That’s why we show how to use these data types in chapter 18. 

The rowid data types are used to store an address for each row in a data- 
base. If you want, you can view the address for a row by using the ROWID 
pseudocolumn. However, since most developers don’t need to view these 
addresses, these data types aren’t presented in this book. For more information 
about the rowid data types, look up “Rowid Datatypes” in the Oracle Database 
SQL Reference manual. 

Most of the Oracle data types correspond to the ANSI-standard data types. 
These data types are listed in the second table in this figure. Here, the second 
column lists the Oracle data type names, and the first column lists the synonyms 
Oracle provides for the ANSI-standard data types. Although you can use these 
synonyms instead of the Oracle data types, there’s no reason to do that. If you 
do, Oracle simply maps the synonyms to the corresponding Oracle data types. 
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Built-in data type categories 


Character Strings of characters 
Numeric Integer, decimal, and floating-point numbers 


Temporal Dates, times, time intervals, and timestamps 8 and 17 
Large object (LOB) Text, images, sound, and video 18 
Rowid Addresses for each row in the database (not covered) 


ANSI data types and Oracle equivalents 

ANSI synonyms = =< — — —Oracdedatatype —ć— č 
CHARACTER (n) CHAR (n) 

CHAR (n) 

CHARACTER VARYING (n) VARCHAR2 (n) 

CHAR VARYING (n) 

NATIONAL CHARACTER (n) NCHAR (n) 

NATIONAL CHAR (n) 

NCHAR (n) 

NATIONAL CHARACTER VARYING (n) NVARCHAR2 (n) 


NATIONAL CHAR VARYING (n) 
NCHAR VARYING (n) 


NUMERIC (p,8) NUMBER (p,8) 
DECIMAL (p,8) 
INTEGER NUMBER (38) 
INT 
SMALLINT 
FLOAT FLOAT (126) 
DOUBLE PRECISION FLOAT (126) 
REAL FLOAT (63) 
Description 
e Oracle provides built-in data types that can be divided into the five categories shown 
above. 


e Oracle provides a synonym for each of the ANSI-standard data types. These are the SQL 
data types that are specified by the American National Standards Institute. 


e When you use the synonym for an ANSI data type, it’s mapped to the appropriate Oracle 
data type indicated in the table above. 


Figure 8-1 Data type overview 
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The character data types 


Figure 8-2 presents the four character data types. The CHAR and 
VARCHAR2 types store strings of characters defined by the ASCII character 
set. This character set uses one byte per character but only defines 256 charac- 
ters, which is usually adequate for working with the English language. 

The NCHAR and NVARCHAR? data types store strings of characters 
defined by the Unicode character set. These types of characters require two or 
three bytes per character, depending on the type of encoding that’s used. How- 
ever, this character set defines over 65,000 characters including most characters 
from most of the world’s languages. Since the Unicode characters are com- 
monly referred to as national characters, these data types begin with the prefix n. 

You use the CHAR and NCHAR data types to store fixed-length strings. 
Data stored using these data types always occupies the same number of bytes, 
regardless of the actual length of the string. These data types are typically used 
to define columns that have a fixed number of characters. For example, the 
vendor_state column in the Vendors table is defined as CHAR(2) because it 
always contains two characters. 

You use the VARCHAR2 and NVARCHAR? data types to store variable- 
length strings. Data stored using these data types occupies only the number of 
bytes needed to store the string. These types are typically used to define col- 
umns whose lengths vary from one row to the next. In general, variable-length 
Strings are more efficient than fixed-length strings because the database only 
uses the amount of disk space required by the value. For example, if you define 
a column as VARCHAR2(50) and it stores a value that’s three characters long, 
the database only stores the three characters. In contrast, if you define the 
column as CHAR(50) and it stores a value that’s three characters long, the 
database stores 50 characters, padding the value with 47 space characters. 

Although you typically store numeric values using numeric data types, the 
character data types may be a better choice for some numeric values. For 
example, you typically store zip codes, telephone numbers, and social security 
numbers in character columns even though they contain only numbers. That’s 
because their values aren’t used in arithmetic operations. In addition, if you 
store these numbers in numeric columns, leading zeroes are stripped, which 
usually isn’t what you want. 
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The character data types used to store standard characters 
Description 


CHAR[(size [BYTE|CHAR] )] Fixed-length strings of character data where size is the number of 
characters or bytes between 1 and 2000. The default is 1 character. 

VARCHAR2 (size [BYTE|CHAR] ) Variable-length strings of character data where size is the maxi- 
mum number of characters or bytes between 1 and 4000. The size 
argument is required. 


The character data types used to store Unicode characters 
Description 


NCHAR [ (size) ] Fixed-length Unicode characters where size is the number of 
characters. The default and minimum size is 1. Maximum size is 
determined by the national character set definition, with an upper 
limit of 2000 bytes. 


NVARCHAR2 (size) Variable-length Unicode characters where size is the maximum 
number of characters. The size argument is required. Maximum 
size is determined by the national character set definition, with an 
upper limit of 4000 bytes. 


Description 

e The ASCH character set provides for 256 characters with 1 byte per character. 

e The CHAR and VARCHAR? types use the ASCH character set. 

e The Unicode character set provides for over 65,000 characters, usually with 2 bytes per 


character. In the Unicode character set, the first 256 characters correspond with the 256 
ASCII characters. 


e The NCHAR and NVARCHAR types use the Unicode character set, which is commonly 
referred to as the national character set. 


e The CHAR and NCHAR data types are typically used for fixed-length strings. These 
data types use the same amount of storage regardless of the actual length of the string. If 
you insert a value that’s shorter than the specified type, the end of the value will be 
padded with spaces. 

e The VARCHAR and NVARCHAR data types are typically used for variable-length 
strings. These data types use only the amount of storage needed for a given string. 


Figure 8-2 The character data types 
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The numeric data types 


Figure 8-3 presents the numeric data types supported by Oracle. The 
NUMBER data type is used to store positive and negative numbers with a fixed 
number of digits. When you specify this data type, you give the precision and 
scale in parentheses. The precision is the total number of digits that can be 
stored in the column, and the scale is the number of digits that can be stored to 
the right of the decimal point. For business applications, this will probably be 
the only numeric data type that you will need. 

For columns that contain integers, you code the NUMBER data type with 
just the precision. For example, a column defined as NUMBER(S) allows for an 
integer value of up to 99999. In contrast, you code both the precision and scale 
for numbers that contain decimal positions. For example, a column defined as 
NUMBER(7, 2) allows for a number seven digits long with two digits to the 
right of the decimal point. As a result, the maximum value for the column is 
99999.99. 

In contrast to the fixed numbers stored by the NUMBER data type, the 
FLOAT data type is used to store floating-point numbers. These numbers are 
useful for storing the values for very large or small numbers, but with a limited 
number of significant digits. As a result, the FLOAT data type doesn’t always 
provide for exact values. 

To express the value of a floating-point number, you can use scientific 
notation. To use this notation, you type the letter E followed by a power of 10. 
For instance, 3.65E+9 is equal to 3.65 x 109, or 3,650,000,000. If you have a 
scientific or mathematical background, of course, you’re already familiar with 
this notation. And if you aren’t already familiar with this notation, you probably 
aren’t going to need to use floating-point numbers. 

The BINARY_FLOAT and BINARY_DOUBLE data types were introduced 
with Oracle Database 10g to conform to the IEEE (Institute of Electrical and 
Electronic Engineers) standard for floating-point arithmetic. BINARY_FLOAT 
is used to represent single-precision, 32-bit, floating-point numbers, while 
BINARY_DOUBLE is used to represent double-precision, 64-bit, floating-point 
numbers. 
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The numeric data types 


NUMBER [(p[, s])] 1 to 22 Stores zero as well as positive and negative fixed numbers with 


absolute values from 1.0x10 to, but not including, 1.0 x 10 . 
The precision (p) can range from 1 to 38. The scale (s) can range 
from -84 to 127. 


FLOAT [(p)] 1 to 22 Stores floating-point numbers. The precision (p) can range from 1 
to 126 bits. A FLOAT value is represented internally as a NUMBER 
value. 

BINARY_FLOAT 5 Stores single-precision, 32-bit, floating-point values. 

BINARY _DOUBLE 9 Stores double-precision, 64-bit, floating-point. 


Description 


The numeric data types are used to store numbers. 

The precision of a NUMBER type indicates the total number of digits that can be stored 
in the data type. The scale indicates the number of decimal digits that can be stored to 
the right of the decimal point. 

For integers, you use the NUMBER type with just the precision in parentheses. For 
decimal numbers, you use the NUMBER type with both the precision and scale in 
parentheses. 

A floating-point number provides for very large and very small numbers that require 
decimal positions, but with a limited number of significant digits. 

A single-precision floating-point number provides for up to 7 significant digits. A 
double-precision floating-point number provides for up to 16 significant digits. 


The BINARY_FLOAT and BINARY_DOUBLE types were introduced with Oracle 10g. 


Figure 8-3 The numeric data types 
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The temporal data types 


Figure 8-4 presents the temporal data types that are used to store dates and 
times. Of these data types, the DATE type is the oldest and the most commonly 
used. This data type stores the century, year, month, day, hour, minute and 
second. 

When you use a SELECT statement to select a column that’s defined with 
the DATE type, the date is displayed with the default format that’s used by the 
database. On most Oracle systems, that means that August 19, 2008 is displayed 
as “19-AUG-08”. Later in this chapter, though, you’ll learn how to use functions 
to display date values in other formats such as “2008-08-19” or “8/19/08”. 

When you store a DATE value, the value includes a time component, but 
this component isn’t displayed by default. Later in this chapter, you’ll learn how 
to use functions to work with the time component of a DATE value. For ex- 
ample, you'll learn how to display the time component in a format like 04:20:36 
PM or 16:20:36. 

When Oracle Database 9i was released, it provided several new data types 
for working with temporal data. To start, it introduced the three TIMESTAMP 
types shown in this figure. These data types provide two advantages over the 
DATE type. First, you can use them to store fractional seconds. Second, you can 
use them to store a time zone. 

Oracle Database 9i also introduced the INTERVAL types shown in this 
figure. These data types make it easier to work with time intervals like 2 days, 2 
hours, and 12 minutes. Before these data types were introduced, you usually 
used the NUMBER type to store intervals of time. 

Since the DATE type is the most widely used temporal type, you’ll learn 
how to work with it in this chapter. For some applications, this is the only 
temporal type that you will need. However, if you want to learn how to use the 
TIMESTAMP and INTERVAL types, you can refer to chapter 17. 


The date/time data types 
Bytes 


TIMESTAMP [ (fsp) ] 


TIMESTAMP [(£sp)] 13 
WITH TIME ZONE 


TIMESTAMP [(fsp)] 7 to 11 
WITH LOCAL TIME ZONE 


INTERVAL YEAR [(yp)] 
TO MONTH 


INTERVAL DAY [(dp)] 
TO SECOND [(f£sp)] 


Description 
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Descriptio 


Stores date and time information using fixed-length fields to 
store the century, year, month, day, minute, hour, and second. 
Valid dates range from January 1, 4712 BC, to December 31, 
9999 AD. This type does not have fractional seconds or a 
time zone. 


An extension of DATE. Stores year, month, day, hour, 
minute, second, and (optionally) the fractional part of a 
second where the fractional second precision (fsp) is the 
number of decimal places used to store the fractional part of 
a second. The fsp can range from 0 to 9, and the default is 6. 
This type does not have a time zone. 


Works like TIMESTAMP except that it includes 
either a time zone region name or a time zone offset. 


Works like TIMESTAMP WITH TIME ZONE 

except that the time zone is set to the database time zone 
when it is stored in the database, and the user sees the 
session time zone when data is retrieved. 


Stores a time interval in years and months. Year 

precision (yp) is the number of digits in the year, and the 
default is 2. 

Stores a time interval in days, hours, minutes and 

seconds. Day precision (dp) is the maximum number of 
digits in the day, and the default is 2. The fsp (fractional 
second precision) can range from 0 to 9, and the default is 6. 


e The temporal data types are typically referred to as the date/time, datetime, or simply 


date types. 


e The default date format is determined by the NLS_DATE_FORMAT and 
NLS_TERRITORY parameters. For more information about these parameters, you can 
refer to the Oracle Database Globalization Support Guide. 

e Inthis chapter, you’ll learn how to work with the most common temporal data type, the 
DATE type. To learn how to work with the other temporal data types, see chapter 17. 


Figure 8-4 The temporal data types 
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The large object data types 


Figure 8-5 presents the new large object (LOB) data types that were intro- 
duced with Oracle Database 8. These data types make it easier to store large 
amounts of character or binary data. For example, they can be used to store text, 
XML, Word, PDF, image, sound, and video files. 

The CLOB (Character Large Object) and NCLOB (National Character 
Large Object) types can store character data. These data types are commonly 
used to store large text and XML files. The primary difference between these 
types is that the CLOB type uses 1 byte per character to store characters in the 
ASCII character set while the NCLOB type uses 2 or 3 bytes per character to 
store characters in the Unicode character set. 

The BLOB (Binary Large Object) type can store any kind of data in binary 
format. It can be used to store binary files such as PDF files, and it can be used 
to store image, sound, and video files. 

The BFILE (Binary File) type stores a pointer to a binary file that’s stored 
outside of the database. These binary files can be stored anywhere that’s acces- 
sible through the host computer’s file system. 

This figure finishes by presenting the old data types for storing large objects 
that were used prior to Oracle Database 8. These data types are provided 
primarily for backward compatibility. As a result, you should use one of the new 
LOB data types for any new development. 
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The large object data types 
Description 
Character large object. Stores up to 8 terabytes of character data inside the database. 
National character large object. Stores up to 8 terabytes of national character data inside the 


database. 
Binary large object. Stores up to 8 terabytes of unstructured binary data inside the database. 


Binary file. Stores a pointer to a large binary file stored outside the database in the file 
system of the host computer. 


The old data types for large objects 
Description 


RAW (size) Stores up to 2000 bytes of binary data that is not intended to be converted by Oracle when 
moving data between different systems. 


LONG Stores up to 2 gigabytes of character data. 
LONG RAW Stores up to 2 gigabytes of row binary data that is not intended to be converted, 


Description 


e The large object (LOB) data types can store large and unstructured data such as text, 
images, sound, and video. 


e For more information about large object data types, see chapter 18. 


Figure 8-5 The large object data types 
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How to convert data from one type to 
another 


As you work with the various data types, you’ll find that you frequently 
need to convert a value from one data type to another. To do that, you can use 
the functions that are described next. 


How to convert characters, numbers, and dates 


Figure 8-6 shows how to use three of the Oracle functions to convert data 
from one type to another. The TO_CHAR function converts an expression to the 
VARCHAR? data type. The TO_LNUMBER function converts an expression to 
the NUMBER type. And the TO_DATE function converts an expression to the 
DATE type. 

As the syntax summaries and examples for these functions show, you can 
code these functions with or without format specifications. When you code a 
format specification, it determines how the data is converted. Otherwise, the 
default format for the system is used. In the next two figures, you’ll learn how 
to code the format specifications for converting numbers and dates. 

To show how these functions work, these examples use literal values to 
specify numbers and dates. As a result, date literals are enclosed in single 
quotation marks, and numeric literals aren’t enclosed. In practice, though, 
you’re more likely to use these functions on columns or expressions. 

In the first two examples, you can see how the TO_CHAR function can be 
used to convert a numeric value to character data. In the next two examples, you 
can see how the system date is converted to character data with and without a 
format specification, In the third example, only the date is shown. In the fourth 
example, both the date and time are shown with a 24-hour clock. In both cases, 
the system date includes the date and time, but the default date format doesn’t 
show the time. 

In the last four examples, you can see how the TO_NUMBER and 
TO_DATE functions work. In the seventh example, the TO_DATE function 
converts the characters “15-APR-08” to a DATE data type with a time value of 
00:00:00. In the eighth example, this DATE data type is converted back to 
characters with a format that shows both the date and time with a 24-hour clock. 

Besides the three TO functions in this figure, Oracle provides TO functions 
for all of its built-in types. For example, you can use the TO_LNCHAR function 
to convert a number or date/time value to an NVARCHAR2 type, and you can 
use the TO_TIMESTAMP function to convert a character type to a 
TIMESTAMP type. For more information about TO functions, navigate to the 
“Functions” section of the Oracle Database SQL Reference manual and scroll 
down to the functions that begin with TO. 

One other way to convert data is to use the CAST function that’s shown in 
this figure. This is an ANSI-standard function that lets you convert, or cast, an 
expression to the data type you specify. In the example, the SELECT statement 
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Oracle functions for converting data 
Function Description 


TO_CHAR(expr[, format] ) Converts the result of an expression to a value of the VARCHAR2 
type. 

TO _NUMBER(expr[, format] ) Converts the result of an expression to a value of NUMBER type. 

TO DATE(expr[, format] ) Converts the result of an expression to a value of DATE type. 


Examples that use the Oracle TO functions 


TO_CHAR(1975.5) 1975.5 
TO_CHAR(1975.5, '§$99,999.99') $1,975.50 


TO CHAR (SYSDATE) 15-APR-08 
TO CHAR(SYSDATE, 'DD-MON-YYYY HH24:MI:SS') 15-APR-2008 10:53:56 


TO_NUMBER('1975.5') 1975.5 
TO_NUMBER('$1,975.5', '$99,999.99') 1975.5 


TO_DATE('15-APR-08') 15-APR-2008 00:00:00 
TO _CHAR(TO DATE('15-APR-08'), 'DD-MON-YYYY HH24:MI:S8') 15-APR-2008 00:00:00 


A statement that uses ANSI-standard CAST functions 


SELECT invoice id, invoice date, invoice total, 
CAST (invoice date AS VARCHAR2(9)) AS varchar date, 
CAST (invoice total AS NUMBER(9)) AS integer total 
FROM invoices 


Description 
e Ifa format element is specified, the TO functions use that format element to format the 
value that’s returned. Otherwise, these functions use the default formats for the system. 


e Although this figure only lists three of the TO functions, Oracle provides TO functions 
for most of the built-in data types. 

e CAST is an ANSI-standard function that you can use for simple conversions, but the TO 
functions give you more control over the conversions that are done. 

e For more information on the format elements for TO functions, please see the next two 
figures. 

e For more information about TO functions, navigate to the “Functions” section of the 


Oracle Database SQL Reference manual and scroll down to the functions that begin with 
TO. 


Figure 8-6 How to convert characters, numbers, and dates 
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casts the invoice_date column to the VARCHAR2(9) data type, and it casts the 
invoice_total column to the NUMBER(9) type. Although this can be useful, 
you’re usually better off using the TO functions because they give you more 
control over the conversions. 


Common number format elements 


Figures 8-7 summarizes the most common number format elements and 
provides some examples. These examples show how to specify the currency 
symbols, group separators, and signs that are used to format numbers. Here, if 
you specify a format that’s too short to accommodate the number, the database 
will return one or more # characters. 

Since these format elements often provide several ways to get the same 
result, you can use the format element that makes the most sense for your 
situation. For example, if you always want the currency symbol to be a dollar 
sign ($), you can use this sign to specify the currency symbol. However, if you 
want to use the default currency symbol for the database, you can use one of the 
other elements to specify the currency symbol. Then, the symbol that’s used will 
vary depending on the NLS parameters that are stored within the database. 

When coding some format elements, you can code the element before or 
after the format for the number. For example, you can code the MI element 
before or after the format for the number. In other cases, you must code the 
format element before the format for the number. For example, you must code 
the FM element before the format for the number. 


D 
G 
0 

$ 
L 
U 
C 
S 
MI 
PR 
FM 


1975.5 
1975.5 
1975.5 
1975.5 
1975.5 
1975.5 
1975.5 
1975.5 
1975.5 
1975.5 
1975.5 
1975.5 
-1975.5 
-1975.5 
1975.5 
-1975.5 
1975.5 
01975.50 
1975.5 


Description 


Number format elements 


Digits 

Decimal point 
Decimal point 
Comma 

Group separator 


Leading or trailing zeros 


Dollar sign 


Local currency symbol 


Dual currency symbol 
Currency symbol 
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Minus sign for negative numbers, plus sign for positive numbers 

Minus sign for negative numbers 

Negative numbers in brackets, one space before and after positive numbers 
Removes leading or trailing spaces or zeros 


Scientific notation 


(none specified) 
999 

9999 
9,999.9 
9G999D9 
99,999.99 
09,999.990 
$99,999.99 
L9,999.99 
u9,999.99 
c9,999.99 
89,999.99 
9,999.998 
9,999.99MI 
9,999.99MI 
9,999.99PR 
9,999.99PR 
FM9,999.99 
9.99EEEE 


1975.5 
Hitt 
1976 
1,975.5 
1,975.5 
1,975.50 
01,975.500 
$1,975.50 
$1,975.50 
$1,975.50 
USD1,975.50 
+1,975.50 
1,975.50- 
1,975.50- 
1,975.50 
<1,975.50> 
1,975.50 
1,975.5 
1.98E+03 


e For more information, look up “Number Format Elements” in the Oracle Database SQL 
Reference manual. 


Figure 8-7 


Common number format elements 
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Common date/time format elements 


Figures 8-8 shows how to use the most common date/time format elements. 
After you read through the list of format elements, you shouldn’t have much 
trouble understanding how the examples work. 

However, the RR format element requires some explanation. This element 
allows you to use a two-digit year to specify years in two different centuries. 
For this century, for example, years from 00 to 49 are interpreted as 2000 to 
2049, and years 50 through 99 are interpreted as 1950 through 1999. As a result, 
98 is interpreted as 1998, not 2098. In most cases, that’s what you want. If that 
isn’t what you want, you can use the YY format to specify the year. 

In addition, note that the FF format element works with the fractional 
seconds component, which isn’t available from the DATE type. As a result, you 
will only use this element when you’re working with the TIMESTAMP and 
INTERVAL types that are described in chapter 17. 

If you study the summary and examples in this figure, you will see that you 
can include periods when using the BC, AD, AM, and PM elements. If, for 
example, you specify “B.C.” as a format element, the formatted date will 
include a value of “B.C” or “A.D.”, depending on whether the date is before or 
after the birth of Christ. 

You will also see that the MONTH and DAY elements pad the end of the 
value that’s returned with spaces to accommodate the longest value that can be 
returned. Since this makes it easy to align the month and day elements of a date 
in columns, this is often what you want. If it isn’t what you want, you can use 
the TRIM function described later in this chapter to trim the spaces from the 
ends of these elements. 

Last, if a format element returns letters, the capitalization that you use when 
you specify the format element corresponds with the capitalization for the 
returned value. If, for example, you specify “MONTH”, the month will be 
returned in all caps (AUGUST). If you specify “Month”, the month will be 
returned with an initial cap (August). And if you specify “month”, the month 
will be returned in lowercase (august). Note that this also applies to the date 
format elements for both the full and abbreviated names of months and days. 
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Common date/time format elements 


Anno Domini 

Before Christ 

Century 

Year spelled out 

Four-digit year 

Two-digit year 

Two-digit round year 
Quarter of year (1-4) 

Name of month padded with spaces 
Abbreviated name of month 
Month (01-12) 

Week of year (1-52) 

Week of month (1-5) 


(none specified) 
DD-MON-YY 
DD-MON-RR 
DD-Mon-YY 

MM/DD/YY 
YYYY-MM-DD 

Dy Mon DD, YY 
MONTH DD, YYYY BC 
Month DD, YYYY B.C. 


HH: MI 
HH24:MI:8S 
HH:MI AM 
HH:MI A.M. 
HH:MI:88 
HH:MI:SS.FF5 
HH:MI:838.FF4 


YYYY-MM-DD HH:MI:883 AM 


Description 
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Name of day padded with spaces 
Abbreviated name of day 

Day of year (1-366) 

Day of month (01-31) 

Day of week (1-7) 

Hour of day (01-12) 

Hour of day (01-24) 

Minute (00-59) 

Second (00-59) 

Seconds past midnight (0-86399) 
Fractional seconds 

Post Meridian 

Ante Meridian 


19-AUG-08 
19-AUG-08 
19-AUG-08 
19-Aug-08 
08/19/08 
2008-08-19 

Tue Aug 19, 08 


AUGUST 
August 


04:20 


16:20: 


04:20 
04:20 


04:20: 
04:20: 
04:20; 


19, 2008 AD 
19, 2008 A.D. 


36 

PM 

P.M. 

36 
36.12345 
36.1234 


2008-08-19 04:20:36 PM 


e Ifyou use the RR format, years from 00 to 49 are interpreted as 2000 to 2049, and years 


50 through 99 are interpreted as 1950 through 1999. 


e For more information about date/time format elements, look up “Datetime Format 
Elements” in the Oracle Database SQL Reference manual. 


Figure 8-8 


Common date/time format elements 
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How to convert characters to and from their 
numeric codes 


Figure 8-9 shows three functions that are used to convert characters to and 
from their equivalent numeric code. The CHR and ASCII functions work with 
standard ASCII characters. 

For instance, the CHR function in this figure converts the number 97 to its 
equivalent ASCII code, the letter a. Conversely, the ASCII function converts the 
letter a to its numeric equivalent of 97. Although the string in the ASCII function 
can include more than one character, please note that only the first character is 
converted. 

The NCHR function works like the CHR function. However, since it usually 
works with the Unicode character set, it’s able to convert thousands of characters. 

The CHR function is frequently used to output ASCII control characters that 
can’t be typed on your keyboard. The three most common control characters are 
presented in this figure. These characters can be used to format output so it’s 
easier to read, The SELECT statement in this figure, for example, uses the 
CHR(13) control character to start a new line after the vendor name and vendor 
address in the output. 
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Three functions for converting characters to and from their numeric codes 
Function Description 


ASCII (string) Returns a NUMBER value that corresponds to the first character in the specified 
string of the CHAR, VARCHAR2, NCHAR, or NVARCHAR2 type. 


CHR (number) Returns a VARCHAR2 value that corresponds to the specified NUMBER value in 
the ASCII character set. 


NCHR (number) Returns a VARCHAR? value that corresponds to the specified NUMBER value in 
the national character set. 


Examples that use the ASCII, CHR, and NCHR functions 
Example Result 

ASCII ('a') 

ASCII ('abc') 

CHR (97) 

NCHR (97) 

NCHR (332) 


ASCII codes for common control characters 
Control character Value 


Tab CHR(9) 


Line feed CHR(10) 
Carriage return CHR(13) 


A SELECT statement that uses the CHR function to format output 


SELECT vendor name || CHR(13) 
|| vendor_address1 || CHR(13) 
|| vendor _city || ', ' || vendor_state || ' ' || vendor_zip code 
AS vendor_address 

FROM vendors 

WHERE vendor id = 1 


The value that’s returned 


VENDOR_ADDRESS 


US Postal Service 
Attn: Supt. Window Services 
Madison, WI 53707 


Figure 8-9 How to convert characters to and from their numeric codes 
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How to work with character data 


Now that you know how to convert the number and date/time data types to 
and from the character data types, you’re read to learn some additional functions 
for working with character data. 


How to use the common character functions 


Figure 8-10 shows how to use some of the common character functions 
available from Oracle. To start, you can use the LTRIM and RTRIM functions to 
remove leading or trailing characters from the left or right side of a string. Or, 
you can use the TRIM function to remove leading and trailing characters from 
both sides of a string. Most of the time, you’ll use these functions to remove 
leading and trailing spaces. However, you can also use these functions to 
remove other characters such as zeros (0), periods (.), and so on. 

Conversely, you can use the LPAD and RPAD functions to add leading or 
trailing characters to the left or right side of a string. Again, most of the time, 
you'll use these functions to add leading and trailing spaces. However, you can 
also use these functions to add other characters. 

You can use the LOWER and UPPER functions to convert the characters in 
a string to lower or uppercase. In addition, you can use the INITCAP function to 
convert the characters in a string so the initial letter in each word is capitalized 
and the rest of the word is lowercase. 

You can use the NVL and NVL2 functions to provide a substitute string if 
the string value is a NULL value. For example, you can use these functions to 
display a value of “Unknown” instead of displaying the default value of null. 

You can use the last four functions presented in this figure to modify a 
string. To start, you can use the SUBSTR function to return the specified 
number of characters from anywhere in a string. When you use this function, 
you can use the second argument to specify the starting point where 1 is the first 
character in the string. In addition, you can use the third argument to specify the 
number of characters that you want to return. If you don’t specify this argument, 
this function will return all characters from the starting point to the end of the 
string. 

To specify the starting point and length arguments for the SUBSTR func- 
tion, it’s common to use the INSTR and LENGTH functions to return integer 
values. For example, if you want to find the position of the first space in a 
string, you can use an INSTR function to return an integer value for its position. 
Then, if necessary, you can perform arithmetic operations on this integer, and 
you can nest the resulting expression within a SUBSTR function. Similarly, you 
can use the LENGTH function to return the number of characters in a string, 
you can include this function in an arithmetic expression, and you can nest the 
expression within a SUBSTR function. 

Finally, you can use the REPLACE function to replace a substring within a 
string with another substring. For example, you might want to use this function 
to replace hypens (-) with periods (.). 


Chapter 8 How to work with data types and functions 


Some common character functions 


Descriptio 


LTRIM(string[, trim_string] ) 


RTRIM(string[, trim_string] ) 


TRIM([trim_char FROM ] string) 


LPAD (string, length[, pad_string] ) 


RPAD (string, length[, pad_string] ) 


LOWER (string) 
UPPER (string) 
INITCAP (string) 


NVL(string, value) 


NVL2 (string, valuel, value2) 


SUBSTR (string, start[, length] ) 


LENGTH (string) 


INSTR (string, find [,start] ) 


REPLACE (string, find, replace) 


By default, this function removes any spaces from the 
left side of the specified string. If you specify a trim 
string, this function removes all characters specified in 
the trim string. 

Same as LTRIM but removes characters from the right 
side of the string instead of the left. 

Removes any spaces from the left and right sides of the 
specified string. If you specify a trim character, this 
function removes the trim character from both sides of 
the string. 


Pads the left side of the string to the specified length 
with spaces or with the characters specified by the pad 
string. 

Pads the right side of the string to the specified length 
with spaces or with the characters specified by the pad 
string. 


Converts the string to lowercase letters. 
Converts the string to uppercase letters. 
Converts the initial letter in each word to uppercase. 


Returns the value argument if the specified string is a 
null value. Otherwise, this function returns the specified 
string. 

Returns the valuel argument if the specified string is a 
null value. Otherwise, this function returns the value2 
argument. 


Returns the specified number of characters (length) 
from the string (string) at the specified starting position 
(start). 


Returns an integer for the number of characters in the 
specified string. 

Returns an integer for the position of the first occur- 
rence of the specified find string in the specified string 
starting at the specified position. If the starting position 
isn’t specified, the search starts at the beginning of the 
string. If the string isn’t found, the function returns zero. 


Returns the string with all occurrences of the specified 
find string replaced with the specified replace string. 


Figure 8-10 | How to use the common character functions (part 1 of 2) 
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Part 2 of figure 8-10 shows examples of the string functions described in 
part 1. To start, the LTRIM and RTRIM examples remove spaces from the left 
and right sides of a string. Then, the TRIM example removes spaces from both 
sides of a string. Here, the result column uses single quotes to identify strings. 
This distinguishes string values from number values, and it shows the leading 
and trailing spaces for the result. 

The second LTRIM example shows how to remove other characters besides 
spaces from a string. Here, the second argument specifies that all dollar signs 
and zeros should be removed from the left side of the string. This shows that the 
second argument of the LTRIM and RTRIM functions allows you to trim 
multiple characters. 

Conversely, the second TRIM example uses a different syntax that only 
allows you to specify a single character. Here, you must specify that character, 
followed by the FROM keyword, followed by the string. 

The LPAD and RPAD examples show how to add spaces or other characters 
to the left or right side of a string. If you want, you can use these functions to 
align the columns of a result set. In particular, note how the LPAD function can 
be used to align numbers with the right side of a column. 

The LOWER, UPPER, and INITCAP examples show how to change the 
case of a string. Note how the INITCAP example works the same regardless of 
whether the input string is uppercase or lowercase. 

The NVL and NVL2 examples show how to substitute a string fora NULL 
value. In these examples, string literals are used to specify a string value, and 
the NULL keyword is used to specify a NULL value. 

The SUBSTR examples show how to return part of a string. To start, the 
first example returns the first five characters of a string. To do that, the second 
argument specifies 1 as the position for first character in the string, and the 
second argument specifies 5 as the length of the string. The second SUBSTR 
example works similarly, but it returns a string that begins at the seventh charac- 
ter and is three characters long. Unlike the first two examples, the third 
SUBSTR example doesn’t include a third argument. As a result, it returns all 
characters in the string from the seventh character to the end of the string. 

The INSTR examples show how to search for a string within a string and to 
return an integer value for its starting position. To start, the first example shows 
how to return an integer for the position of the first space in a string. In this 
case, the space is the sixth character of the string. Then, the second example 
shows how to return an integer for the first hyphen in the string. Next, the third 
example shows how to return an integer for the second hyphen in the string. To 
do that, this example uses the third argument of the INSTR function to start the 
search at the fifth character in the string. Finally, the fourth example shows how 
to return the position for a string literal of “1212”. This shows that you can use 
this function to search for a single character or a string of multiple characters. 

The LENGTH examples show how to return an integer for the length of a 
string. Note that this includes any leading or trailing spaces. 

The REPLACE examples show how to replace part of a string with another 
string. Here, the first example replaces all of the hyphens within the string with 
periods. Then, the second example replaces all of the hypens within the string 
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Character function examples 


LTRIM(' 
RTRIM(' 
TRIM(! 


John Smith ‘') 
John Smith ‘') 
John Smith ') 


LTRIM('$0019.99', '$0') 
TRIM('S' FROM '$0019.99') 


LPAD('$19.99', 15) 
LPAD('$2150.78', 15) 
LPAD('$2150.78', 15, '.') 
RPAD('John', 15) 
RPAD('John', 15, '.') 


LOWER('CA') ‘ca! 
UPPER('ca') 'CA! 
INITCAP ('john smith') 
INITCAP ('JOHN SMITH') 


NVL ('Fresh Corn Records', 
NVL (NULL, 


NVL2 (NULL, 'Known', 


SUBSTR(' (559) 555-1212', 
SUBSTR(' (559) 555-1212', 
SUBSTR(' (559) 555-1212', 


INSTR('(559) 555-1212', ! 


INSTR ('559-555-1212', '-!) 


INSTR ('559-555-1212', '-! 
INSTR ('559-555-1212', 


LENGTH (' (559) 555-1212') 
LENGTH(' (559) 555-1212 


‘Unknown Company Name') 


‘Unknown Company Name') 
NVL2 ('Fresh Corn Records', 


Known’, 'Unknown') 


'Unknown') 


1, 5) 
7, 3) 
7) 


') 


5) 


'1212') 


REPLACE ('559-555-1212', '-! 
REPLACE ('559-555-1212', '-!, 


Description 
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'John Smith I 
' John Smith! 
'John Smith! 


'19.99' 
10019.99! 


$19.99! 

$2150.78! 

Vee eee $2150.78! 
' John i 
"John. ..ccecccee! 


'John Smith! 
'John Smith' 


'Fresh Corn Records! 
‘Unknown Company Name’ 
'Known'! 

‘Unknown! 


'(559)! 
15551 
1555-1212' 


14 
18 


'559.555.1212! 
15595551212! 


e For more information about these and other functions, you can look them up in the 
Oracle Database SQL Reference manual. To do that, you can start by navigating to the 
“Functions” topic. Then, you can navigate to the function you want to learn more about. 


Figure 8-10 


How to use the common character functions (part 2 of 2) 
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with nothing. To do that, the third example uses two single quotes with nothing 
between them to specify a value of nothing. Although these examples use a 
string that contains a single character for the second and third arguments, you 
can specify a string that contains multiple characters for these arguments. 


How to parse a string 


Figure 8-11 shows how to parse a string. To start, the first SELECT state- 
ment shows how you can use the SUBSTR function to format columns in a 
result set. Here, the second column begins by getting the first name of the 
vendor contact and appending a space to the end of this name. Then, it uses the 
SUBSTR function to return the first initial of the last name for the vendor 
contract and appends this initial to the string. Finally, this column adds a period 
after the initial for the last name. 

The third column displays the vendor’s phone number without an area code. 
To accomplish that, this column specification uses the SUBSTR function to 
return all characters of the vendor_phone column starting at the seventh charac- 
ter. Of course, this only works correctly because all of the phone numbers are 
stored in the same format with the area code in parentheses. 

This SELECT statement also shows how you can use a function in the 
search condition of a WHERE clause. This condition uses the SUBSTRING 
function to select only those rows with an area code of 559. To do that, it 
retrieves three characters from the vendor_phone column starting with the 
second character. Again, this assumes that the phone numbers are all in the 
same format and that the area code is enclosed in parentheses. 

The second SELECT statement shows how to extract multiple values from a 
single column. In this example, both a first and a last name are stored in the 
Name column of the String_Sample table. As a result, if you want to work with 
the first and last names independently, you have to parse the string using the 
string functions. In this example, the first name is considered to be every non- 
blank character up to the first blank, and the last name is considered to be every 
character after the first blank. 

To extract the first name, this statement uses the SUBSTR and INSTR 
functions. First, it uses the INSTR function to locate the first space in the Name 
column. Then, it uses the SUBSTR function to extract all of the characters up to 
that space. Note that a value of one is subtracted from the value that’s returned 
by the INSTR function, so the space itself isn’t included in the first name. 

To extract the last name, this statement uses the same functions. First, it 
uses the INSTR function to locate the first space in the Name column. Then, it 
uses the SUBSTR function to extract all of the characters from that space to the 
end of the string. Note that a value of one is added to the value that’s returned 
by the INSTR function, so the space itself isn’t included in the last name. 

As you review this example, you should keep in mind that I kept it simple 
so that you can focus on how the string functions are used. You should realize, 
however, that this code won’t work for all names. If, for example, a first name 
contains a space, such as in the name Jean Paul, this code won’t work properly. 
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A SELECT statement that uses the SUBSTR function 


SELECT vendor name, 
vendor_contact _first_name || ' ' 
SUBSTR (vendor contact last name, 1, 1) || '.' 
AS contact_name, 
SUBSTR(vendor_phone, 7) AS phone 
FROM vendors 
WHERE SUBSTR(vendor phone, 2, 3) = '559! 
ORDER BY vendor_name 


The result set 


VENDOR_N&ME CONTACT_NAME | PHONE 
1 Abbey Office Furnishings Kyra F, 555-8300 
2 EFI Industries Erick K. 555-1551 


3 Bill Marvin Electric Ine Kattlln H. 555-5106 
4 Cal State Termite Demetrius H. 555-1534 
5 California Business Machines Andars R. 555-5570 


(34 rows selected) 
The String_Sample table 


Lizbeth Darien 
Darnell O'Sullivan 
Lance Plnos-Potter 
Jean Paul Renard 
Alisha yon Strump 


A SELECT statement that parses a string 


SELECT SUBSTR (name, 1, (INSTR(name, ' ') - 1)) AS first_name, 
SUBSTR(name, (INSTR(name, ' ') + 1)) AS last_name 
FROM string sample 


The result set 


FIRST_NAME LAST_NAME 
1 Lizbeth Darlan 
2 Darnell O'Sullivan 
3 Lance Plnos-Potter 
4 Jean Paul Renard 


5 Alisha yon Strump 


Description 
e When parsing strings, it’s common to nest one function within another. 


Figure 8-11 How to parse a string 
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That illustrates the importance of designing a database so that this type of 
problem doesn’t occur. You’ll learn more about that in section 3. For now, just 
realize that if a database is designed correctly, you won’t have to worry about 
this type of problem. 


How to sort a string in numerical sequence 


Figure 8-12 addresses a common problem that occurs when you store 
numeric data in a character column and then want to sort the column in numeric 
sequence. In this figure, the columns in the String_Sample table are defined 
with character data types. As a result, the first example sorts the values in the ID 
column in alphabetical sequence instead of numeric sequence, which isn’t what 
you want. 

One way to solve this problem is to convert the values in the ID column to 
integers for sorting purposes as shown in the SELECT statement in the second 
example. To do that, this example uses the TO_LNUMBER function to convert 
the ID column to a NUMBER value. As a result, this example sorts the rows in 
numeric sequence. 

Another way to solve this problem is to pad the numbers with leading zeros 
or spaces as shown in the third example. To do that, this example uses the LPAD 
function to pad the left side of the ID column with zeros. When it does, it 
specifies an alias for this column of lpad_id. Then, it sorts the result set by this 
alias, which causes the rows to be returned in numeric sequence. 


How to sort mixed-case columns in alphabetical 
sequence 


You may remember from chapter 3 that uppercase letters come before 
lowercase letters when you sort a string in ascending sequence. As a result, a 
vendor name like “ASC Signs” comes before “Abbey Office Furnishings.” 

Now that you know how to use the LOWER and UPPER functions, though, 
this problem is easy to fix. Just convert the strings to all uppercase or all lower- 
case before you sort them as in 

ORDER BY LOWER (vendor_name) 
or 

ORDER BY UPPER (vendor_name) 

Either way, the rows will be sorted in the correct alphabetical sequence. 
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A table sorted by a character column 


SELECT * FROM string sample 
ORDER BY id 


The result set 


pII NAME 
Lizbeth Darien 
Lance Pinos-Potter 


Darnell O'Sullivan 
Jaan Paul Renard 
Allsha yon Strump 


A table sorted by a character column treated as a numeric column 


SELECT * FROM string sample 
ORDER BY TO_NUMBER (id) 


The result set 


Lizbeth Darlen 
Darnell O'Sullivan 
Alisha yon Strump 
Lance Pinos-Potter 
Jean Paul Renard 


A table sorted by a character column that’s padded with leading zeros 


SELECT LPAD(id, 2, '0') AS lpad_id, name 
FROM string sample 
ORDER BY lpad_id 


The result set 


LPAD_ID |) NAME 
Lizbeth Darien 
Darnell O'Sullivan 


Allsha yon Strump 
Lance Pinos-Potter 


Jean Paul Renard 


Description 


e If you sort by a character column that contains numbers, you may receive unexpected 
results. To avoid that, you can convert the string column to a numeric value or you can 
pad the left side of the character column with leading zeros or spaces. 


Figure 8-12 How to sort a string in numerical sequence 
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How to work with numeric data 


In addition to the character functions, Oracle provides several functions for 
working with numeric data. Although you’ll probably use only a couple of these 
functions regularly, you should be aware of them in case you ever need them. 


How to use the common numeric functions 


Figure 8-13 summarizes some of the common numeric functions that Oracle 
provides. The function you’ll probably use most often is the ROUND function. 
This function rounds a number to the precision specified by the length argu- 
ment. Note that you can round the digits to the left of the decimal point by 
coding a negative value for this argument. However, you’re more likely to code 
a positive number to round the digits to the right of the decimal point. 

Another function that you might use regularly is the TRUNC function. This 
function works like the ROUND function, but it truncates the number instead of 
rounding to the nearest number. In other words, this function chops off the end 
of the number without doing any rounding. For example, if you round 19.99 to 
the nearest integer, you get a value of 20. However, if you truncate 19.99, you 
get a value of 19. 

If you study the examples in this figure, you shouldn’t have much trouble 
understanding how they work. For instance, you can easily see how a negative 
length argument causes the ROUND and TRUNC functions to round and 
truncate to the left of the decimal point. 

In addition to the functions shown in this figure, Oracle provides many 
other functions for performing mathematical calculations. In particular, it 
provides many functions for performing trigonometric calculations. Since 
you’re not likely to use these functions for business applications, they aren’t 
presented in this book. However, if you need a function that isn’t shown here, 
you can search for the function in the Oracle Database SQL Reference manual. 
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Some common numeric functions 


Description 


Returns the number rounded to the precision specified by the 
length argument. If the length argument is positive, the digits 
to the right of the decimal point are rounded. If it’s negative, 
the digits to the left of the decimal point are rounded. 

Returns the number truncated as specified by the length 
argument. If the length argument is omitted, this function 
truncates all numbers to the right of the decimal point. If it’s 
negative, this function truncates the specified number of digits 
to the left of the decimal point. 


Returns the smallest integer that is greater than or equal to 


ROUND (number [, 


TRUNC (number [, 


CEIL (number) 


FLOOR (number) 


ABS (number) 
SIGN (number) 


MOD(number, number divisor) 


POWER (number, number _exponent) 


SQRT (number) 


length] ) 


length] ) 


number, 
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Returns the largest integer that is less than or equal to 


number. 


Returns the absolute value of number. 


Returns -1 if the number is less than 0, 0 if the number is 
equal to 0, and 1 if the number is greater than 0. 


Returns the remainder after the number argument is divided 


by the divisor. 


Returns the number after it has been raised to the power of 


the specified exponent. 


Returns the square root of the number. 


Examples that use the numeric functions 


ROUND (12.5) 

ROUND (12.4999, 
ROUND (12.4999, 
ROUND (12.4944, 
ROUND (1264.99, 


TRUNC (12.5) 

TRUNC (12.4999, 
TRUNC (12.4944, 
TRUNC (1264.99, 


CEIL (1.25) 

CEIL(-1.25) 
FLOOR (1.25) 
FLOOR (-1.25) 


Note 


ABS (1.25) 
ABS (-1.25) 


SIGN(1.25) 
SIGN (0) 
SIGN (-1.25) 


MOD(10, 10) 
MOD(10, 9) 


POWER(2, 2) 
POWER(2, 2.5) 


SQRT (4) 
SQRT (5) 


4 
5.65685... 


2 
2.23606... 


e ‘You can use these functions to work with any numeric data type or any nonnumeric type 
that can be implicitly converted to a numeric type. 


Figure 8-13 


How to use the common numeric functions 
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How to search for floating-point numbers 


Earlier in this chapter, you learned that floating-point numbers don’t always 
contain exact values. From a practical point of view, though, that means that 
you don’t want to search for an exact value when you’re working with floating- 
point numbers. If you do, you’ll miss values that are essentially equal to the 
value you want. 

To illustrate, consider the Float_Sample table shown in figure 8-14. This 
table includes a column named float_value that’s defined with the 
BINARY_DOUBLE data type. Here, the first example shows what happens 
when a SELECT statement retrieves all rows where the value stored in the 
float_value column is equal to 1. As you can see, the result set includes only the 
second row, even though the first and third rows also contain a value that’s 
approximately equal to 1. 

To solve this problem, you can search for an approximate value when you 
perform a search on a column with a floating-point data type. To do that, you 
can search for a range of values as shown by the second example in this figure. 
This SELECT statement searches for values between .99 and 1.01. 

Another alternative is to round the value that you’re searching for. This is 
illustrated by the third example. In both the second and third examples, the 
statement returns the three rows in the Float_Sample table that have a float 
value that’s approximately equal to 1. 
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The Float_Sample table 


FLOAT_ID [FLOAT VALUE 
1 0.999999999999999 
21.0 
31 .p00000000000001 


4 1234.5676901 2345 
5 999.04440209348 
6 24.04849 


A SELECT statement that searches for an exact value 


SELECT * FROM float _sample 
WHERE float value = 1 


The result set 


È FLOAT D |FLOAT_VALUE 
1 21.0 


A SELECT statement that searches for a range of values 


SELECT * FROM float_sample 
WHERE float_value BETWEEN 0.99 AND 1.01 


The result set 


FLOAT D |FLOAT_VALUE 
1 0.999995939939999 


21.0 
31 .0o0000000000001 


A SELECT statement that searches for rounded values 
SELECT * FROM float_sample 


WHERE ROUND (float value, 2) = 1 
The result set 


4) FLOAT_ID |FLOAT_YALUE 
1 0.999993599999999 


21.0 
31 .000000000000001 


Description 


e Because the values of floating-point numbers aren’t always exact, you’ll want to search 
for approximate values when you retrieve floating-point numbers. To do that, you can 
specify a range of values, or you can use the ROUND function to search for rounded 


values. 


Figure 8-14 How to search for floating-point numbers 
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How to work with date/time data 


In the topics that follow, you’ll learn how to use some of the functions that 
Oracle provides for working with dates and times. As you’ll see, these include 
functions for extracting different parts of a date/time value and for performing 
operations on dates and times. 


How to use the common date/time functions 


Figure 8-15 shows how the date/time functions work and how the plus and 
minus signs can be used to work with dates. If you study the summaries and 
examples, you shouldn’t have much trouble using these functions. 

To get the current date and time, you can use the SYSDATE or 
CURRENT_DATE function, although the SYSDATE function is more com- 
monly used. In most cases, both of these functions return the same value. 
However, if a session time zone has been set, the value returned by the 
CURRENT_DATE function will be adjusted to accommodate that time zone. 

When you use the ROUND and TRUNC functions without a date format, 
the default is to round or truncate to a date at 00:00:00 on a 24-hour clock. But 
when you specify a date format like MI for minutes, the date/time value is 
rounded or truncated to that time component. 

When you use the MONTHS_BETWEEN function, it returns a whole 
number of months when the day component is the same for both date argu- 
ments. However, if the day component isn’t the same for both date arguments, 
this function returns a decimal number. 

When you use the ADD_MONTHS function, you can specify a positive 
value to add months to the specified date or a negative number to subtract 
months from the specified date. If necessary, the function will increase or 
decrease the year component. 

When you use the NEXT_DAY function, you specify a day of the week as 
the second argument. Then, the function returns the next day of the week that 
comes after the specified date. For instance, the first example of this function 
returns the first Friday after August 15, 2008, which is August 22, and the 
second example returns the first Thursday after August 15, 2008, which is 
August 21. Note that you can use full or abbreviated names for the second 
argument. 

When you use the addition (+) and subtraction (-) operators, you can add a 
specific number of days to a date or subtract a specific number of days from a 
date. You can also use the subtraction operator to subtract one date from an- 
other. If the first date comes after the second date, a positive integer is returned. 
Otherwise, a negative integer is returned. 
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Some common date/time functions 


SYSDATE Returns the current local date and time based on the 
operating system’s clock. 

CURRENT_DATE Returns the local date and time adjusted for the current 
session time zone. 


ROUND (date[, date format] ) Returns the date rounded to the unit specified by the date 
format. If the format is omitted, rounds to the nearest day. 


TRUNC (date[, date format] ) Works like the ROUND function but truncates the date. 


MONTHS BETWEEN (datel, date2) Returns the number of months between date1 and date2. 


ADD _MONTHS(date, integer_months) Adds the specified number of months to the specified date 
and returns the resulting date. 


LAST_DAY (date) Returns the date for last day of the month for the specified 
date. 


NEXT_DAY (date, day of week) Returns the date for the next day of the week that comes 
after the specified date. 


Two operators for working with dates 
Operator Description 
Adds the specified number of days to a date. 


Subtracts a specified number of days from a date. Or, subtracts one date from another and 
returns the number of days between the two dates. 


Examples that use the date/time functions 


SYSDATE 19-AUG-08 04:20:36 PM 
ROUND (SYSDATE) 20-AUG-08 12:00:00 AM 
TRUNC (SYSDATE, 'MI') 19-AUG-08 04:20:00 PM 


MONTHS BETWEEN ('01-SEP-08','01-AUG-08') 1 

MONTHS BETWEEN ('15-SEP-08', '01-AUG-08') 1.4516129032258064... 
ADD_MONTHS ('19-AUG-08', -=1) 19-JUL-08 

ADD_MONTHS ('19-AUG-08', 11) 19-JUL-09 


LAST_DAY ('15-FEB-08') 29-FEB-08 
NEXT_DAY('15-AUG-08', 'FRIDAY') 22-AUG-08 
NEXT_DAY('15-AUG-08', 'THURS') 21-AUG-08 


SYSDATE - 1 18-AUG-08 
SYSDATE + 7 26-AUG-08 
SYSDATE - TO_DATE('01-JAN-08') 231 

TO DATE('O1-JAN-08') - SYSDATE -231 


Figure 8-15 How to use the common date/time functions 
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How to parse dates and times 


Figure 8-16 shows you how to use the TO_CHAR function to return the 
various parts of a DATE value as a string. To do that, you specify the appropri- 
ate date format element for the part of the DATE value that you want to return. 
For example, the “MONTH” element returns a string that spells out the month 
in uppercase, the “Month” element spells out the month with an initial cap, and 
the “MM” element returns a string that contains the number for the month. 

If you need to get an integer value for part of the date, you can use the 
TO_CHAR function with the appropriate date format element to return a string 
that contains numeric characters. Then, you can use the TO_LNUMBER function 
to convert that string to an integer value. For example, you can use the 
TO_CHAR function with the “MM” element to return a string that contains the 
the number for the month. Then, you can use the TOLNUMBER function to 
convert the string that’s returned to an integer value. 
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Examples that parse a date/time value 


TO_CHAR(SYSDATE, 'DD-MON-RR HH:MI:SS') 19-AUG-08 04:20:36 PM 


TO CHAR(SYSDATE, 'YEAR') TWO THOUSAND EIGHT 
TO_CHAR(SYSDATE, 'YEAR') Two Thousand Eight 
TO CHAR(SYSDATE, 'YYYY') 2008 

TO CHAR(SYSDATE, 'YY') 08 


TO CHAR(SYSDATE, '"MONTH') AUGUST 
TO CHAR(SYSDATE, 'MON') AUG 
TO_CHAR(SYSDATE, 'MM') 08 


TO _CHAR(SYSDATE, 'DD') 19 
TO_CHAR(SYSDATE, 'DAY') TUESDAY 
TO_CHAR(SYSDATE, 'DY') TUES 


TO_CHAR(SYSDATE, 'HH24') 16 
TO_CHAR(SYSDATE, 'HH') 04 
TO CHAR(SYSDATE, 'MI') 20 
TO CHAR(SYSDATE, 'SS') 


TO CHAR(SYSDATE, 'CC') 
TO CHAR(SYSDATE, 'Q') 
TO CHAR(SYSDATE, 'WW') 
TO CHAR(SYSDATE, 'W') 
TO _CHAR(SYSDATE, 'DDD') 
TO CHAR(SYSDATE, 'D') 


Example 

TO NUMBER(TO CHAR(SYSDATE, 'HH24')) 
TO NUMBER(TO CHAR(SYSDATE, 'HH')) 
TO NUMBER(TO CHAR(SYSDATE, 'SS')) 


Description 
e You can use the TO_CHAR method to retrieve any part of a date as a string. 


e Ifyou need to perform mathematical operations on a part of a date, you can use the 
TO_NUMBER function to convert any numeric part of a date to a NUMBER value. 


Figure 8-16 How to parse dates and times 
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How to perform a date search 


Because date/time values always contain both a date and a time component, 
you may need to ignore the time component when you search for a date value. 
In figure 8-17, for example the Date_Sample table has a date_id column defined 
with the NUMBER type and a start_date column defined with the DATE type. 
In this table, the time components in the first three rows have a zero value. In 
contrast, the time components in the next three rows have non-zero time compo- 
nents. 

The first SELECT statement shows a typical problem that you can encoun- 
ter when searching for dates. Here, the statement searches for rows in the 
Date_Sample table that have a date value of “‘28-FEB-06”. Since a time compo- 
nent isn’t specified by the date literal in the WHERE clause, a default time 
component of 00:00:00 is added to the date when it is converted to a DATE 
value. However, because the one row with this date has a time that isn’t 
00:00:00, no rows are returned by this statement. 

To solve this problem, you can search for a range of dates that includes only 
the dates you’re looking for as shown by the second SELECT statement in this 
figure. In this statement, the search is for any date greater than or equal to 
February 28, 2006, and less than March 1, 2006. As a result, this search will 
find any date on February 28, no matter what the time component is. 

Another alternative is to use the TRUNC function to truncate the time 
component from the date/time values as shown in the third SELECT statement 
in this figure. In this statement, the time components of the start dates are set to 
00:00:00, which is the same as the time component for the date literal. 

A third approach to this problem is to extract the month, day, and year 
components of each date in the WHERE clause using the techniques in the 
previous figure. Then, the condition in the WHERE clause can search for the 
right month and day and year values. Usually, though, you won’t need this 
technique because the techniques in this figure are easier to use. 


Chapter 8 How to work with data types and functions 


The Date_Sample table 


START_DATE 
1 01-M4R-1979 00:00:00 
2 28-FEB-1999 00:00:00 
3 31-OCT-2003 00:00:00 


4 28-FEB-2005 10:00:00 
5 28-FEB-2006 13:56:32 
6 01-M4R-2006 09:02:25 


A SELECT statement that fails to return a row 


SELECT * FROM date sample 
WHERE start_date = '28-FEB-06' 


A SELECT statement that searches for a range of dates 


SELECT * FROM date_sample 
WHERE start date >= '28-FEB-06' AND start_date < '01-MAR-06' 


The result set 


W DATED | START_DaTE 
1 5 28-FEB-06 


A SELECT statement that uses the TRUNC function to remove time values 


SELECT * FROM date _ sample 
WHERE TRUNC (start_date) = '28-FEB-06' 


The result set 


W DATED | ST4RT_DATE 
1 5 28-FEB-06 


Description 


e Ifyou perform a search using a date string that doesn’t include the time, the date string is 
converted implicitly to a date/time value of 12:00:00 AM (midnight) or 00:00:00 on a 
24-hour clock. Then, if the date columns you’re searching have time values other than 
12:00:00 AM, you have to accommodate the times in the search condition. 

e You can accommodate non-zero time components by searching for a range of dates 
rather than specific dates or by using the TRUNC function to remove the time component. 


Figure 8-17 How to perform a date search 
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How to perform a time search 


When you search for a time value without specifying a date component, 
Oracle automatically uses a default date of January 1 for the current year. That’s 
why the first SELECT statement in figure 8-18 doesn’t return any rows. In other 
words, even though one row has the correct time value in its search condition, 
that row doesn’t have the correct date value. 

The second SELECT statement in this figure shows how you can solve this 
problem. Here, the search condition in this statement uses the TO_CHAR 
function to convert the date/time values in the start_date column to string values 
that don’t contain a date component. To do that, it uses a date format element of 
“HH24:MI:SS”. That way, the string that’s returned will match the string literal 
for the time that’s specified in the search condition. 

Like the first SELECT statement, the third SELECT statement doesn’t 
return any rows. The problem, of course, is the same as in the first statement. To 
solve this problem, you can use the TO_CHAR function to remove the date 
component as shown in the fourth SELECT statement. Once you do that, this 
SELECT statement will return all rows that have a time within the range that’s 
specified in the search condition. 
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The Date_Sample table 


START_DATE 
1 O1-M4R-1979 00:00:00 
2 28-FEB-1999 00:00:00 
3 31-OCT-2003 00:00:00 


4 28-FEB-2005 10:00:00 
5 28-FEB-2006 13:56:32 
6 01-M4R-2006 09:02:25 


A SELECT statement that fails to return a row 


SELECT * FROM date sample 
WHERE start_date = TO_DATE('10:00:00', 'HH24:MI:88') 


A SELECT statement that ignores the date component 


SELECT * FROM date_sample 
WHERE TO_CHAR(start_date, 'HH24:MI:S8') = '10:00:00' 


The result set 


W osteo | START_DATE 
1 4 28-FEB-05 


Another SELECT statement that fails to return a row 


SELECT * FROM date_sample 
WHERE start_date >= TO DATE('09:00:00', 'HH24:MI:88') 
AND start_date «< TO_DATE('12:59:59', 'HH24:MI:S8') 


Another SELECT statement that ignores the date component 


SELECT * FROM date sample 
WHERE TO_CHAR(start_date, 'HH24:MI:88') >= '09:00:00' 
AND TO CHAR(start date, 'HH24:MI:S8') < '12:59:59! 


The result set 


W DATED | START _DATE 
1 4 28-FEB-05 
2 6 01-MAR-08 


Fil 


Description 


e If you use the TO_DATE function to return a date that only contains a time component, 
the date is converted implicitly to the first day of the current year. For 2006, for example, 
the date is converted to January 1, 2006. Then, if the date columns you’re searching have 
dates other than that date, you have to accommodate those dates in the search condition. 


e To ignore the date component of a date/time value, you can convert the DATE type to a 
CHAR type and only return the part of the time value that you want to use. 


Figure 8-18 How to perform a time search 
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Other functions you should know about 


In addition to the functions presented so far in this chapter, Oracle provides 
some other general purpose functions that you should know about. 


How to use the CASE function 


Figure 8-19 presents the two formats of the CASE function. This function 
returns a value that’s determined by the conditions you specify. The two ex- 
amples in this figure show how this function works. 

The first example uses a simple CASE function. When you use this func- 
tion, Oracle compares the input expression you code in the CASE clause with 
the expressions you code in the WHEN clauses. In this example, the input 
expression is the value in the terms_id column of the Invoices table, and the 
WHEN expressions are the valid values for this column. When Oracle finds a 
WHEN expression that’s equal to the input expression, it returns the expression 
specified in the matching THEN clause. If the value of the terms_id column is 
3, for example, this function returns the value “Net due 30 days.” Although it’s 
not shown in this example, you can also code an ELSE clause at the end of the 
CASE function. Then, if none of the when expressions are equal to the input 
expression, the function returns the value specified in the ELSE clause. 

The simple CASE function is typically used with columns that can contain a 
limited number of values, such as the terms_id column used in this example. In 
contrast, the searched CASE function can be used for a wide variety of pur- 
poses. For example, you can test for conditions other than equal with this 
function. In addition, each condition can be based on a different column or 
expression. The second example in this figure shows how this function works. 

This example determines the status of the invoices in the Invoices table. To 
do that, the searched CASE function determines the number of days between 
the current date and the invoice due date. If the difference is greater than 30, the 
CASE function returns the value “Over 30 days past due.” Similarly, if the 
difference is greater than 0, the function returns the value “1 to 30 days past 
due.” Note that if an invoice is 45 days old, both of these conditions are true. In 
that case, the function returns the expression associated with the first condition 
since this condition is evaluated first. In other words, the sequence of the 
conditions is critical to getting logical results. If neither of the conditions is true, 
the function uses the ELSE clause to return a value “Current.” 
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The syntax of the simple CASE expression 


CASE input_expression 
WHEN when_expression_1 THEN result_expression_1 
[WHEN when expression 2 THEN result_expression_2]... 
[ELSE else_result_expression] 

END 


The syntax of the searched CASE expression 


CASE 
WHEN conditional _expression_1 THEN result_expression_1 
[WHEN conditional expression_2 THEN result_expression 2]... 
[ELSE else_result_expression] 

END 


A SELECT statement that uses a simple CASE expression 


SELECT invoice number, terms id, 
CASE terms_id 
WHEN 1 THEN 'Net due 10 days’ 
WHEN 2 THEN 'Net due 20 days’ 
WHEN 3 THEN 'Net due 30 days’ 
WHEN 4 THEN 'Net due 60 days’ 
WHEN 5 THEN 'Net due 90 days’ 
END AS terms 
FROM invoices 


The result set 


 INvoICE_NUMBER |] TERMS_ID |TERMS 
1 QPS8872 4 Nat due 60 days 


2 9545443 4 Net due 60 days 
3 P-O608 5 Net due 90 days 


A SELECT statement that uses a searched CASE expression 


SELECT invoice number, invoice total, invoice date, invoice due date, 
CASE 
WHEN (SYSDATE - invoice due date) > 30 THEN 'Over 30 days past due! 
WHEN (SYSDATE - invoice due date) > 0 THEN '1 to 30 days past due! 
ELSE 'Current' 
END AS status 
FROM invoices 
WHERE invoice total - payment total - credit _total > 0 


The result set 


INVOICE_NUMBER |A INVOICE_TOTAL INVOICE_DATE | INvOICE_DUE_paTE | status 
37 547480102 224 19-MAY-08 24-JUN-08 1 to 30 days past due 


38 547481328 224 20-MAY-08 25-JUN-08 1 to 30 days past dua 
39 40316 21842 18-JUL-08 20-JUL-08 Current 
40 31361833 579.42 23-MAY-08 03-JUN-08 Over 30 days past due 


Note 
e The last example assumes the SYSDATE function returns July 18, 2008. 


Figure 8-19 How to use the CASE expression 
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How to use the COALESCE, NVL, and NVL2 
functions 


Figure 8-20 presents three functions that you can use to substitute non-null 
values for null values: COALESCE, NVL, and NVL2. Although these functions 
are similar, COALESCE is the most flexible because it lets you specify a list of 
values. Then, it returns the first non-null value in the list. In contrast, the NVL 
and NVL2 functions aren’t as flexible since they only let you substitute a single 
non-null value for a null value. Still, these functions are useful in some situa- 
tions. 

The first example uses the COALESCE function to return the value of the 
payment_date column if that column doesn’t contain a null value. Otherwise, it 
returns the value of the invoice_due_date column if that column doesn’t contain 
a null value. Otherwise, it returns a date of January 1, 1900. Note that all of the 
arguments for this function must be of the same date type. In this case, all three 
arguments are of the DATE type. 

The second example performs a similar task using the NVL function. In this 
example, the NVL function substitutes a string value of “Unpaid” for a null 
payment date. Since both arguments must be of the same data type, this ex- 
ample uses the TO_CHAR function to convert the payment_date column to a 
string. 

The third example performs a similar task using the NVL2 function. In this 
example, the NVL2 function substitutes a string value of “Unpaid” for a null 
payment date, and it substitutes a string value of “Paid” for a non-null payment 
date. With the NVL2 function, only the second and third arguments need to be 
of the same data type. As a result, in this example, it isn’t necessary to convert 
the first argument to a string. 
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The syntax of the COALESCE, NVL, and NVL2 functions 


COALESCE (expressionl [, expression2][, expression3]...) 
NVL (expression, null_replacement) 
NVL2 (expression, not _null replacement, null replacement) 


A SELECT statement that uses the COALESCE function 


SELECT payment_date, invoice _due_date, 
COALESCE (payment date, invoice due date, TO DATE('01-JAN-1900')) 
AS payment_date_2 

FROM invoices 


The result set 


PAYMENT _DATE |[ INVOICE DUE Date |Ñ PAYMENT_DATE_2 
4 11-APR-08 22-APR-08 11-APR-08 
2 1414-MAY-06 23-MAY -08 14-May-08 
3 (null) 30-JUN-08 30-JUN-08 


4 12-May-06 1868-MAY-06 12-May-08 
5 13-May-06 18-MAY -06 13-May-08 
6 (null) 26-JUN-08 28-JUN-08 


A SELECT statement that uses the NVL function 
SELECT payment date, 
NVL (TO _CHAR(payment date), 'Unpaid') AS payment date 2 
FROM invoices 


The result set 


PAYMENT_DATE |Ñ PAYMENT_DATE_2 
1 11-APR-08 11-@PR-08 
2 14-MAY-08 14-May-08 


3 (null) Unpaid 
4 12-MA¥-08 12-MA¥-08 


A SELECT statement that uses the NVL2 function 
SELECT payment date, 
NVL2 (payment _date, 'Paid', 'Unpaid') AS payment _ date 2 
FROM invoices 


The result set 


PAYMENT DATE |} PAYMENT _DATE_2 
1 11-4PR-08 
2 14-May-08 


3 (nul) 
4 12-MAY-08 


Description 


e The COALESCE, NVL, and NVL2 functions let you substitute non-null values for null 
values. 


e The COALESCE function returns the first expression in a list of expressions that isn’t 
null. All of the expressions in the list must have the same data type. If all of the expres- 
sions are null, this function returns a null value. 


Figure 8-20 How to use the COALESCE, NVL, and NVL2 functions 
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How to use the GROUPING function 


In chapter 5, you learned how to use the ROLLUP and CUBE operators to 
add summary rows to a summary query. When you do that, a null value is 
assigned to any column in a summary row that isn’t being summarized. 

If you want to assign a value other than null to these columns, you can do 
that by using the GROUPING function, as illustrated in figure 8-21. This 
function accepts the name of a column as its argument. The column you specify 
must be one of the columns named in a GROUP BY clause that includes the 
ROLLUP or CUBE operator. 

The example in this figure shows how you can use the GROUPING func- 
tion in a summary query that summarizes vendors by state and city. This is the 
same summary query that was described in chapter 5. However, instead of 
simply retrieving the values of the vendor_state and vendor_city columns from 
the base table, this query uses the GROUPING function within a CASE function 
to determine the values that are assigned to those columns. If a row is added to 
summarize the vendor_state column, for example, the value of the GROUPING 
function for that column is 1. Then, the CASE function assigns a literal string 
value of “========” to that column. Otherwise, it retrieves the value of the 
column from the Vendors table. Similarly, if a row is added to summarize the 
vendor_city column, a literal string value of “========” is assigned to that 
column. As you can see in the result set shown here, this makes it more obvious 
which columns are being summarized. 

This technique is particularly useful if the columns you’re summarizing can 
contain null values. In that case, it would be difficult to determine which rows 
are summary rows and which rows simply contain null values. Then, you may 
not only want to use the GROUPING function to replace the null values in 
summary rows, but you may want to use the COALESCE or NVL function to 
replace null values retrieved from the base table. 
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The syntax of the GROUPING function 


GROUPING (column _name) 


A summary query that uses the GROUPING function 


SELECT 
CASE 
WHEN GROUPING (vendor state) = 1 THEN 's=s=s==s===' 
ELSE vendor_state 
END AS vendor state, 
CASE 
WHEN GROUPING (vendor city) = 1 THEN !==========' 
ELSE vendor_city 
END AS vendor city, 
COUNT (*) AS qty vendors 
FROM vendors 
WHERE vendor state IN ('IA', 'NdJ') 
GROUP BY ROLLUP (vendor state, vendor city) 
ORDER BY vendor_state DESC, vendor city DESC 


The result set 


D venvor_st... | venpor_crry |B aTY_ VENDORS 
Washington 
Fairfield 
East Brunswick 


Washington 
Falrflald 


1 
1 
2 
4 
1 
1 
2 
6 


Description 


You can use the GROUPING function to determine when a null value is assigned to a 
column as the result of the ROLLUP or CUBE operator. The column you name in this 
function must be one of the columns named in the GROUP BY clause. 

If a null value is assigned to the specified column as the result of the ROLLUP or CUBE 
operator, the GROUPING function returns a value of 1. Otherwise, the GROUPING 
function returns a value of 0. 

You typically use the GROUPING function with the CASE expression. Then, if the 
GROUPING function returns a value of 1, you can assign a value other than null to the 
column. 


Figure 8-21 How to use the GROUPING function 
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How to use the ranking functions 


Figure 8-22 shows how to use the Oracle ranking functions. These functions 
provide a variety of ways that you can rank the rows that are returned by a result 
set. All four of these functions have a similar syntax and work similarly. 

The first example shows how to use the ROW_NUMBER function. Here, 
the SELECT statement retrieves two columns from the Vendors table. The first 
column uses the ROW_NUMBER function to sort the result set by 
vendor_name and to number each row in the result set. To show that the first 
column has been sorted and numbered correctly, the second column displays the 
vendor_name. 

To accomplish the sorting and numbering, you code the name of the 
ROW_NUMBER function, followed by a set of parentheses, followed by the 
OVER keyword and a second set of parentheses. Within the second set of 
parentheses, you code the required ORDER BY clause that specifies the sort 
order. In this example, for instance, the ORDER BY clause sorts by 
vendor_name in ascending order. However, you can code more complex OR- 
DER BY clauses if necessary. In addition, you can code an ORDER BY clause 
that applies to the entire result set. In that case, the ORDER BY clause within 
the ranking function is used to number the rows and the ORDER BY clause 
outside the ranking function is used to sort the rows after the numbering has 
been applied. 

The second example shows how to use the optional PARTITION BY clause 
of a ranking function. This clause allows you to specify a column that’s used to 
divide the result set into groups. In this example, for instance, the PARTITION 
BY clause uses a column within the Vendors table to group vendors by state and 
to sort these vendors by name within each state. 

However, you can also use the PARTITION BY clause when a SELECT 
statement joins one or more tables, like this: 

SELECT vendor name, invoice number, 

ROW NUMBER{) OVER (PARTITION BY vendor_name 
ORDER BY invoice number) AS row_number 
FROM vendors JOIN invoices 
ON vendors.vendor_id = invoices.vendor id 
Here, the invoices will be grouped by vendor and sorted within each vendor by 
invoice number. As a result, if a vendor has three invoices, these invoices will be 
sorted by invoice number and numbered from 1 to 3. 
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The syntax for the four ranking functions 


ROW_NUMBER () OVER ([partition_ by clause] order _by clause) 
RANK () OVER ([partition_by clause] order_by clause) 
DENSE_RANK () OVER ([partition_by clause] order_by clause) 


NTILE (integer_expression) OVER ([partition_by clause] order_by clause) 


A query that uses the ROW_NUMBER function 


SELECT ROW_NUMBER() OVER(ORDER BY vendor_name) AS row_number, vendor_name 
FROM vendors 


The result set 


H ROW NUMBER |È VENDOR MAME 
1 ASC Signs 


2 AT&T 


3 Abbey Office Furnishings 
4 American Booksellers Assoc 


5 American Express 


A query that uses the PARTITION BY clause 


SELECT ROW NUMBER () 
OVER (PARTITION BY vendor_state ORDER BY vendor_name) 
AS row_number, vendor_name, vendor state 

FROM vendors 


The result set 


ROW NUMBER VENDOR_NAME YENDOR_STATE 
1 AT&T 
2 Computer Library 
3 Walls Fargo Bank 


1 ASC Signs 
2 Abbey Office Furnishings 
3 American Express 


Description 
e The ROW_NUMBER, RANK, DENSE_RANK, and NTILE functions are known as 
ranking functions. 


e The ROW_NUMBER function returns the sequential number of a row within a partition 
of a result set, starting at 1 for the first row in each partition. 

e The ORDER BY clause of a ranking function specifies the sort order in which the 
ranking function is applied. 


e The optional PARTITION BY clause of a ranking function specifies the column that’s 
used to divide the result set into groups. 


Figure 8-22 How to use the ranking functions (part 1 of 2) 
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The third example shows how the RANK and DENSE_RANK functions 
work. You can use these functions to rank the rows in a result set. In this ex- 
ample, both the RANK and the DENSE_RANK functions sort all invoices in 
the Invoices table by the invoice total. Since the first three rows have the same 
invoice total, both of these functions give these three rows the same rank, 1. 
However, the fourth row has a different value. To calculate the value for this 
row, the RANK function adds 1 to the total number of previous rows. In other 
words, since the first three rows are tied for first place, the fourth row gets 
fourth place and is assigned a rank of 4. 

The DENSE_RANK function, on the other hand, calculates the value for 
the fourth row by adding 1 to the rank for the previous row. As a result, this 
function assigns a rank of 2 to the fourth row. In other words, since the first 
three rows are tied for first place, the fourth row gets second place. 

The fourth example shows how the NTILE function works. You can use this 
function to divide the rows in a partition into the specified number of groups. 
When the rows can be evenly divided into groups, this function is easy to 
understand. For example, if a result set returns 100 rows, you can use the 
NTILE function to divide this result set into 10 groups of 10. However, when 
the rows can’t be evenly divided into groups, this function is a little more 
difficult to understand. In this figure, for example, the NTILE function is used 
to divide a result set that contains 5 rows. Here, the first NTILE function divides 
this result into 2 groups with the first having 3 rows and the second having 2 
rows. The second NTILE function divides this result set into 3 groups with the 
first having 2 rows, the second having 2 rows, and the third having 1 row. And 
so on. Although this doesn’t result in groups with even numbers of rows, the 
NTILE function creates the number of groups specified by its argument. 

In this figure, the examples for the RANK, DENSE_RANK, and NTILE 
functions don’t include PARTITION BY clauses. As a result, these functions are 
applied to the entire result set. However, whenever necessary, you can use the 
PARTITION BY clause to divide the result set into groups just as shown in the 
second example for the ROW_NUMBER function. 
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A query that uses the RANK and DENSE_RANK functions 


SELECT RANK() OVER (ORDER BY invoice total) AS rank, 
DENSE_RANK() OVER (ORDER BY invoice _total) AS dense_rank, 
invoice total, invoice number 

FROM invoices 


The result set 


RANK E pense Rank | INVOICE_TOTAL F INVOICE NUMBER 
6 25022117 
6 24863706 
6 24780512 
9.95 21-4748363 
9.95 21-4923721 
104-342-6068 


Ww v y a a 


1 
1 
1 
4 
4 
6 


aon fF ù y > 


Description 


e The RANK and DENSE_RANK functions both return the rank of each row within the 
partition of a result set. 


e Ifthere is a tie, both of these functions give the same rank to all rows that are tied. 


e To determine the rank for the next distinct row, the RANK function adds 1 to the total 
number of rows, while the DENSE_RANK function adds 1 to the rank for the previous 
row. 


A query that uses the NTILE function 


SELECT terms description, 
NTILE(2) OVER (ORDER BY terms_id) AS tile2, 
NTILE(3) OVER (ORDER BY terms id) AS tile3, 
NTILE(4) OVER (ORDER BY terms_id) AS tiled 
FROM terms 


The result set 


TERMS DESCRIPTION | Tuez tes ð Tes | 
1 Net due 10 days 1 1 

2 Net due 20 days 1 
3 Net due 30 days 1 
4 Net due 60 days 2 
5 Net due 90 days 2 


1 
1 1 
2 2 
2 3 
3 4 


Description 
e The NTILE function divides the rows in a partition into the specified number of groups. 


e Ifthe rows can’t be evenly divided into groups, the later groups may have one less row 
than the earlier groups. 


Figure 8-22 How to use the ranking functions (part 2 of 2) 
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Perspective 


In this chapter, you learned about the most common Oracle data types. You 
also learned how to use the common functions for working with strings, numbers, 
and dates. At this point, you have the essential skills you need to write SQL 
statements at a professional level. 

However, there’s more to learn about Oracle data types and functions. In 
particular, chapter 17 shows how to work with other temporal types, and chapter 
18 shows how to work with the large object (LOB) types. From that point on, you 
should be able to use the Oracle Database SQL Reference manual to learn about 
the data types and functions that aren’t presented in this book. 


Terms 
data type ASCII character 
character data type Unicode character 
string data type national character 
text data type fixed-length string 
numeric data type variable-length string 
integer precision 
floating-point number scale 
temporal data type scientific notation 
date/time data type single-precision number 
datetime data type double-precision number 
date data type casting 
large object (LOB) data type ranking functions 
rowid data type 
Exercises 


1. Write a SELECT statement that returns these columns from the Invoices table: 
The invoice_total column 


Use the TO_CHAR function to return the invoice_total column with 2 
digits to the right of the decimal point. 


Use the TO_CHAR function to return the invoice_total column with no 
digits to the right of the decimal point and no decimal point 


Use the CAST function to return the invoice_total column as an integer 
with seven digits 


2. Write a SELECT statement that returns these columns from the Invoices table: 
The invoice_date column 


Use the TO_CHAR function to return the invoice_date column with its full 
date and time including a four-digit year on a 24-hour clock 


3. 
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Use the TO_CHAR function to return the invoice_date column with its full 
date and time including a four-digit year on a 12-hour clock with an am/pm 
indicator 


Use the CAST function to return the invoice_date column as 
VARCHAR2(10) 


Write a SELECT statement that returns these columns from the Vendors table: 
The vendor_name column 
The vendor_name column in all capital letters 
The vendor_phone column 
The last four digits of each phone number 


When you get that working right, add the columns that follow to the result set. 
This is more difficult because these columns require the use of functions within 
functions. 


The second word in each vendor name if there is one; otherwise, blanks 


The vendor_phone column with the parts of the number separated by dots as 
in 555.555.5555 


Write a SELECT statement that returns these columns from the Invoices table: 
The invoice_number column 
The invoice_date column 
The invoice_date column plus 30 days 
The payment_date column 


A column named days_to_pay that shows the number of days between the 
invoice date and the payment date 


The number of the invoice_date’s month 
The four-digit year of the invoice_date 
The last day of the invoice date’s month 


When you’ve got this working, add a WHERE clause that retrieves just the 
invoices for the month of May based on the invoice date, not the number of the 
invoice month. 


Write a SELECT statement that returns these columns from the Invoices table: 
The invoice_number column 


The balance due (invoice total minus payment total minus credit total) with 
commas, a decimal point, and two decimal positions 


A column named “Balance Rank” that uses the RANK function to return a 
column that ranks the balance due in descending order. 
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